Dataset info
| Number of variables | 181 |
|---|---|
| Number of observations | 2000 |
| Missing cells | 240887 (66.5%) |
| Duplicate rows | 0 (0.0%) |
| Total size in memory | 2.7 MiB |
| Average record size in memory | 1.4 KiB |
Variables types
| Numeric | 47 |
|---|---|
| Categorical | 32 |
| Boolean | 27 |
| Date | 0 |
| URL | 0 |
| Text (Unique) | 1 |
| Rejected | 74 |
| Unsupported | 0 |
Warnings
coligada_mais_antiga_ativa has 1737 (86.9%) missing values | Missing |
coligada_mais_antiga_baixada has 1999 (> 99.9%) missing values | Missing |
coligada_mais_nova_ativa has 1737 (86.9%) missing values | Missing |
coligada_mais_nova_baixada has 1999 (> 99.9%) missing values | Missing |
de_faixa_faturamento_estimado has 118 (5.9%) missing values | Missing |
de_faixa_faturamento_estimado_grupo has 118 (5.9%) missing values | Missing |
de_indicador_telefone has 1804 (90.2%) missing values | Missing |
de_nivel_atividade has 51 (2.5%) missing values | Missing |
de_saude_rescencia has 64 (3.2%) missing values | Missing |
de_saude_tributaria has 64 (3.2%) missing values | Missing |
dt_situacao only contains datetime values, but is categorical. Consider applying pd.to_datetime() | Type |
dt_situacao has a high cardinality: 1315 distinct values | Warning |
empsetorcensitariofaixarendapopulacao has 600 (30.0%) missing values | Missing |
faturamento_est_coligados has 1738 (86.9%) missing values | Missing |
faturamento_est_coligados_gp is highly correlated with faturamento_est_coligados (ρ = 0.9509546939) | Rejected |
fl_epp has constant value "False" | Rejected |
fl_optante_simei has 346 (17.3%) missing values | Missing |
fl_optante_simples has 346 (17.3%) missing values | Missing |
fl_st_especial has constant value "False" | Rejected |
grau_instrucao_macro_analfabeto has 1994 (99.7%) missing values | Missing |
grau_instrucao_macro_desconhecido has constant value "nan" | Rejected |
grau_instrucao_macro_escolaridade_fundamental is highly correlated with grau_instrucao_macro_analfabeto (ρ = 0.9980369909) | Rejected |
grau_instrucao_macro_escolaridade_media has 1694 (84.7%) missing values | Missing |
grau_instrucao_macro_escolaridade_superior is highly correlated with grau_instrucao_macro_escolaridade_media (ρ = 0.9178829036) | Rejected |
idade_acima_de_58 is highly correlated with grau_instrucao_macro_analfabeto (ρ = 1) | Rejected |
idade_ate_18 has 1990 (99.5%) missing values | Missing |
idade_de_19_a_23 has 1879 (94.0%) missing values | Missing |
idade_de_24_a_28 is highly correlated with idade_de_19_a_23 (ρ = 0.9086682511) | Rejected |
idade_de_29_a_33 is highly correlated with grau_instrucao_macro_escolaridade_superior (ρ = 0.9749169386) | Rejected |
idade_de_34_a_38 is highly correlated with idade_de_29_a_33 (ρ = 0.994053508) | Rejected |
idade_de_39_a_43 is highly correlated with idade_de_34_a_38 (ρ = 0.9834868533) | Rejected |
idade_de_44_a_48 is highly correlated with idade_de_39_a_43 (ρ = 0.9491747935) | Rejected |
idade_de_49_a_53 is highly correlated with grau_instrucao_macro_analfabeto (ρ = 0.9891584832) | Rejected |
idade_de_54_a_58 is highly correlated with idade_de_49_a_53 (ρ = 0.9617506252) | Rejected |
idade_maxima_coligadas is highly correlated with coligada_mais_antiga_ativa (ρ = 0.9997565556) | Rejected |
idade_maxima_socios has 659 (33.0%) missing values | Missing |
idade_media_coligadas has 1737 (86.9%) missing values | Missing |
idade_media_coligadas_ativas is highly correlated with idade_media_coligadas (ρ = 0.9996384056) | Rejected |
idade_media_coligadas_baixadas has 1999 (> 99.9%) missing values | Missing |
idade_media_socios is highly correlated with idade_maxima_socios (ρ = 0.9554366317) | Rejected |
idade_minima_coligadas is highly correlated with coligada_mais_nova_ativa (ρ = 1) | Rejected |
idade_minima_socios is highly correlated with idade_media_socios (ρ = 0.9505781936) | Rejected |
max_faturamento_est_coligados is highly correlated with faturamento_est_coligados_gp (ρ = 0.9359933557) | Rejected |
max_faturamento_est_coligados_gp is highly correlated with max_faturamento_est_coligados (ρ = 0.9599434338) | Rejected |
max_filiais_coligados has 1919 (96.0%) missing values | Missing |
max_funcionarios_coligados_gp has 28 (1.4%) zeros | Zeros |
max_funcionarios_coligados_gp has 1830 (91.5%) missing values | Missing |
max_meses_servicos is highly correlated with grau_instrucao_macro_analfabeto (ρ = 0.9664890124) | Rejected |
max_meses_servicos_all is highly correlated with grau_instrucao_macro_analfabeto (ρ = 0.9664890124) | Rejected |
max_vl_folha_coligados has 1858 (92.9%) missing values | Missing |
max_vl_folha_coligados_gp is highly correlated with max_funcionarios_coligados_gp (ρ = 0.9048394021) | Rejected |
media_faturamento_est_coligados has 1738 (86.9%) missing values | Missing |
media_faturamento_est_coligados_gp is highly correlated with media_faturamento_est_coligados (ρ = 0.9503777089) | Rejected |
media_filiais_coligados has 1919 (96.0%) missing values | Missing |
media_funcionarios_coligados_gp has 28 (1.4%) zeros | Zeros |
media_funcionarios_coligados_gp has 1830 (91.5%) missing values | Missing |
media_meses_servicos is highly correlated with max_meses_servicos (ρ = 0.9868452786) | Rejected |
media_meses_servicos_all is highly correlated with grau_instrucao_macro_analfabeto (ρ = 0.9765604493) | Rejected |
media_vl_folha_coligados has 1858 (92.9%) missing values | Missing |
media_vl_folha_coligados_gp is highly correlated with media_funcionarios_coligados_gp (ρ = 0.9041231022) | Rejected |
meses_ultima_contratacaco has 1533 (76.6%) missing values | Missing |
min_faturamento_est_coligados has 1738 (86.9%) missing values | Missing |
min_faturamento_est_coligados_gp has 1738 (86.9%) missing values | Missing |
min_filiais_coligados is highly correlated with min_faturamento_est_coligados_gp (ρ = 0.942216953) | Rejected |
min_funcionarios_coligados_gp is highly correlated with min_filiais_coligados (ρ = 0.9051143156) | Rejected |
min_meses_servicos is highly correlated with meses_ultima_contratacaco (ρ = 0.9357049853) | Rejected |
min_meses_servicos_all is highly correlated with grau_instrucao_macro_analfabeto (ρ = 0.9830648574) | Rejected |
min_vl_folha_coligados has 1858 (92.9%) missing values | Missing |
min_vl_folha_coligados_gp is highly correlated with min_vl_folha_coligados (ρ = 0.9817628137) | Rejected |
nm_divisao has a high cardinality: 72 distinct values | Warning |
nm_meso_regiao has 278 (13.9%) missing values | Missing |
nm_micro_regiao has a high cardinality: 73 distinct values | Warning |
nm_micro_regiao has 278 (13.9%) missing values | Missing |
nu_meses_rescencia has 199 (10.0%) missing values | Missing |
percent_func_genero_fem is highly correlated with grau_instrucao_macro_analfabeto (ρ = 0.9457217032) | Rejected |
percent_func_genero_masc has 83 (4.2%) zeros | Zeros |
percent_func_genero_masc has 1659 (83.0%) missing values | Missing |
qt_admitidos has 1533 (76.6%) missing values | Missing |
qt_admitidos_12meses has 342 (17.1%) zeros | Zeros |
qt_admitidos_12meses has 1533 (76.6%) missing values | Missing |
qt_alteracao_socio_180d has constant value "nan" | Rejected |
qt_alteracao_socio_365d has constant value "nan" | Rejected |
qt_alteracao_socio_90d has constant value "nan" | Rejected |
qt_alteracao_socio_total has constant value "nan" | Rejected |
qt_art has 1974 (98.7%) missing values | Missing |
qt_coligadas has 1791 (89.5%) missing values | Missing |
qt_coligados is highly correlated with qt_coligadas (ρ = 0.9950754524) | Rejected |
qt_coligados_agropecuaria has 1737 (86.9%) missing values | Missing |
qt_coligados_atividade_alto has 1737 (86.9%) missing values | Missing |
qt_coligados_atividade_baixo has 1737 (86.9%) missing values | Missing |
qt_coligados_atividade_inativo has 1737 (86.9%) missing values | Missing |
qt_coligados_atividade_medio has 1737 (86.9%) missing values | Missing |
qt_coligados_atividade_mt_baixo has 1737 (86.9%) missing values | Missing |
qt_coligados_ativo is highly correlated with qt_coligados (ρ = 0.9922537325) | Rejected |
qt_coligados_baixada is highly correlated with min_vl_folha_coligados_gp (ρ = 0.9776137125) | Rejected |
qt_coligados_ccivil has 212 (10.6%) zeros | Zeros |
qt_coligados_ccivil has 1737 (86.9%) missing values | Missing |
qt_coligados_centro has 255 (12.8%) zeros | Zeros |
qt_coligados_centro has 1737 (86.9%) missing values | Missing |
qt_coligados_comercio has 144 (7.2%) zeros | Zeros |
qt_coligados_comercio has 1737 (86.9%) missing values | Missing |
qt_coligados_epp has 1737 (86.9%) missing values | Missing |
qt_coligados_exterior has 1737 (86.9%) missing values | Missing |
qt_coligados_inapta has 1737 (86.9%) missing values | Missing |
qt_coligados_industria has 229 (11.5%) zeros | Zeros |
qt_coligados_industria has 1737 (86.9%) missing values | Missing |
qt_coligados_ltda has 1737 (86.9%) missing values | Missing |
qt_coligados_matriz is highly correlated with qt_coligados_ativo (ρ = 0.9940913825) | Rejected |
qt_coligados_me has 1737 (86.9%) missing values | Missing |
qt_coligados_mei has 1737 (86.9%) missing values | Missing |
qt_coligados_nordeste has 93 (4.7%) zeros | Zeros |
qt_coligados_nordeste has 1737 (86.9%) missing values | Missing |
qt_coligados_norte is highly correlated with idade_ate_18 (ρ = 0.9001915505) | Rejected |
qt_coligados_nula has 1737 (86.9%) missing values | Missing |
qt_coligados_sa is highly correlated with qt_coligados_matriz (ρ = 0.9176972372) | Rejected |
qt_coligados_serviço has 96 (4.8%) zeros | Zeros |
qt_coligados_serviço has 1737 (86.9%) missing values | Missing |
qt_coligados_sudeste is highly correlated with qt_coligados_sa (ρ = 0.9150368291) | Rejected |
qt_coligados_sul has 258 (12.9%) zeros | Zeros |
qt_coligados_sul has 1737 (86.9%) missing values | Missing |
qt_coligados_suspensa has 1737 (86.9%) missing values | Missing |
qt_desligados has 54 (2.7%) zeros | Zeros |
qt_desligados has 1533 (76.6%) missing values | Missing |
qt_desligados_12meses has 332 (16.6%) zeros | Zeros |
qt_desligados_12meses has 1533 (76.6%) missing values | Missing |
qt_ex_funcionarios is highly correlated with qt_desligados (ρ = 0.9999961944) | Rejected |
qt_filiais is highly skewed (γ1 = 27.83323791) | Skewed |
qt_filiais has 1818 (90.9%) zeros | Zeros |
qt_funcionarios is highly correlated with idade_de_44_a_48 (ρ = 0.9521282594) | Rejected |
qt_funcionarios_12meses is highly correlated with qt_funcionarios (ρ = 0.9870025118) | Rejected |
qt_funcionarios_24meses is highly correlated with qt_funcionarios_12meses (ρ = 0.9812462049) | Rejected |
qt_funcionarios_coligados is highly correlated with qt_coligados_sudeste (ρ = 0.9073230395) | Rejected |
qt_funcionarios_coligados_gp is highly correlated with max_funcionarios_coligados_gp (ρ = 0.9739337033) | Rejected |
qt_funcionarios_grupo is highly correlated with qt_filiais (ρ = 0.9003805679) | Rejected |
qt_ramos_coligados is highly correlated with qt_coligadas (ρ = 0.9442422997) | Rejected |
qt_regioes_coligados has 1737 (86.9%) missing values | Missing |
qt_socios has 502 (25.1%) missing values | Missing |
qt_socios_coligados is highly correlated with qt_funcionarios_coligados (ρ = 0.9113228208) | Rejected |
qt_socios_feminino has 1377 (68.8%) missing values | Missing |
qt_socios_masculino has 1139 (57.0%) missing values | Missing |
qt_socios_pep is highly correlated with qt_socios_masculino (ρ = 0.9867818247) | Rejected |
qt_socios_pf is highly correlated with qt_socios_pep (ρ = 0.9309382851) | Rejected |
qt_socios_pj has 502 (25.1%) missing values | Missing |
qt_socios_pj_ativos is highly correlated with qt_socios_pj (ρ = 1) | Rejected |
qt_socios_pj_baixados has 1981 (99.1%) missing values | Missing |
qt_socios_pj_inaptos has 1981 (99.1%) missing values | Missing |
qt_socios_pj_nulos has 1981 (99.1%) missing values | Missing |
qt_socios_pj_suspensos has 1981 (99.1%) missing values | Missing |
qt_socios_st_regular is highly correlated with qt_socios_pf (ρ = 0.9649092969) | Rejected |
qt_socios_st_suspensa has 1986 (99.3%) missing values | Missing |
qt_ufs_coligados has 1737 (86.9%) missing values | Missing |
sum_faturamento_estimado_coligadas is highly correlated with media_faturamento_est_coligados_gp (ρ = 0.9984290135) | Rejected |
total is highly correlated with qt_funcionarios_24meses (ρ = 0.9914053461) | Rejected |
total_filiais_coligados is highly correlated with qt_socios_coligados (ρ = 0.9651560413) | Rejected |
tx_crescimento_12meses has 210 (10.5%) zeros | Zeros |
tx_crescimento_12meses has 1658 (82.9%) missing values | Missing |
tx_crescimento_24meses has 131 (6.6%) zeros | Zeros |
tx_crescimento_24meses has 1646 (82.3%) missing values | Missing |
tx_rotatividade has 362 (18.1%) zeros | Zeros |
tx_rotatividade has 1533 (76.6%) missing values | Missing |
vl_faturamento_estimado_aux is highly correlated with total (ρ = 0.9849750322) | Rejected |
vl_faturamento_estimado_grupo_aux is highly correlated with qt_socios_st_suspensa (ρ = 0.9487115171) | Rejected |
vl_folha_coligados is highly correlated with qt_socios_pep (ρ = 0.9470885171) | Rejected |
vl_folha_coligados_gp is highly correlated with vl_folha_coligados (ρ = 0.9239805026) | Rejected |
vl_frota has 1885 (94.2%) missing values | Missing |
vl_idade_maxima_socios_pj has 1981 (99.1%) missing values | Missing |
vl_idade_media_socios_pj is highly correlated with vl_idade_maxima_socios_pj (ρ = 0.9804465153) | Rejected |
vl_idade_minima_socios_pj is highly correlated with vl_idade_media_socios_pj (ρ = 0.9810183741) | Rejected |
vl_potenc_cons_oleo_gas is highly correlated with vl_folha_coligados_gp (ρ = 0.9999548647) | Rejected |
vl_total_tancagem has constant value "nan" | Rejected |
vl_total_tancagem_grupo is highly correlated with vl_folha_coligados_gp (ρ = 1) | Rejected |
vl_total_veiculos_antt has constant value "nan" | Rejected |
vl_total_veiculos_antt_grupo has constant value "nan" | Rejected |
vl_total_veiculos_leves has 32 (1.6%) zeros | Zeros |
vl_total_veiculos_leves has 1864 (93.2%) missing values | Missing |
vl_total_veiculos_leves_grupo is highly skewed (γ1 = 44.23709965) | Skewed |
vl_total_veiculos_leves_grupo has 1834 (91.7%) zeros | Zeros |
vl_total_veiculos_pesados is highly correlated with vl_potenc_cons_oleo_gas (ρ = 0.9118619239) | Rejected |
vl_total_veiculos_pesados_grupo is highly correlated with vl_potenc_cons_oleo_gas (ρ = 0.909344757) | Rejected |
coligada_mais_antiga_ativa
Numeric
| Distinct count | 261 |
|---|---|
| Unique (%) | 13.1% |
| Missing (%) | 86.9% |
| Missing (n) | 1737 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 219.5476553 |
|---|---|
| Minimum | 1.766666667 |
| Maximum | 636.9 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 1.766666667 |
|---|---|
| 5-th percentile | 39.19666667 |
| Q1 | 117.15 |
| Median | 193.9666667 |
| Q3 | 280.1833333 |
| 95-th percentile | 538.2966667 |
| Maximum | 636.9 |
| Range | 635.1333333 |
| Interquartile range | 163.0333333 |
Descriptive statistics
| Standard deviation | 145.1208491 |
|---|---|
| Coef of variation | 0.6609993121 |
| Kurtosis | 0.9429744526 |
| Mean | 219.5476553 |
| MAD | 111.2763039 |
| Skewness | 1.092550751 |
| Sum | 57741.03333 |
| Variance | 21060.06084 |
| Memory size | 31.2 KiB |
| Value | Count | Frequency (%) | |
| 247.5 | 2 | 0.1% | |
| 231.1333333 | 2 | 0.1% | |
| 243.7 | 2 | 0.1% | |
| 262.6 | 1 | 0.1% | |
| 71.56666667 | 1 | 0.1% | |
| 538.4666667 | 1 | 0.1% | |
| 113.0666667 | 1 | 0.1% | |
| 294.7666667 | 1 | 0.1% | |
| 163.2666667 | 1 | 0.1% | |
| 358.9666667 | 1 | 0.1% | |
| Other values (250) | 250 | 12.5% | |
| (Missing) | 1737 | 86.9% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 1.766666667 | 1 | 0.1% | |
| 5.266666667 | 1 | 0.1% | |
| 11.26666667 | 1 | 0.1% | |
| 16.93333333 | 1 | 0.1% | |
| 23.76666667 | 1 | 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 636.9 | 1 | 0.1% | |
| 636.6 | 1 | 0.1% | |
| 636.0333333 | 1 | 0.1% | |
| 635.8 | 1 | 0.1% | |
| 635.0666667 | 1 | 0.1% |
coligada_mais_antiga_baixada
Categorical
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | > 99.9% |
| Missing (n) | 1999 |
| 83.96666667 | 1 |
|---|---|
| (Missing) |
| Value | Count | Frequency (%) | |
| 83.96666667 | 1 | 0.1% | |
| (Missing) | 1999 | > 99.9% |
| Max length | 17 |
|---|---|
| Mean length | 3.007 |
| Min length | 3 |
| Contains chars | True |
| Contains digits | True |
| Contains spaces | False |
| Contains non-words | True |
coligada_mais_nova_ativa
Numeric
| Distinct count | 253 |
|---|---|
| Unique (%) | 12.7% |
| Missing (%) | 86.9% |
| Missing (n) | 1737 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 116.7159696 |
|---|---|
| Minimum | 1.533333333 |
| Maximum | 634.4 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 1.533333333 |
|---|---|
| 5-th percentile | 4.876666667 |
| Q1 | 42.3 |
| Median | 87.8 |
| Q3 | 176.35 |
| 95-th percentile | 307.98 |
| Maximum | 634.4 |
| Range | 632.8666667 |
| Interquartile range | 134.05 |
Descriptive statistics
| Standard deviation | 99.60012948 |
|---|---|
| Coef of variation | 0.8533547708 |
| Kurtosis | 2.359500176 |
| Mean | 116.7159696 |
| MAD | 79.01042664 |
| Skewness | 1.304812186 |
| Sum | 30696.3 |
| Variance | 9920.185791 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 43.7 | 3 | 0.1% | |
| 86.06666667 | 2 | 0.1% | |
| 50.8 | 2 | 0.1% | |
| 37.93333333 | 2 | 0.1% | |
| 37.3 | 2 | 0.1% | |
| 2.7 | 2 | 0.1% | |
| 96.26666667 | 2 | 0.1% | |
| 243.7 | 2 | 0.1% | |
| 4.833333333 | 2 | 0.1% | |
| 3.4 | 2 | 0.1% | |
| Other values (242) | 242 | 12.1% | |
| (Missing) | 1737 | 86.9% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 1.533333333 | 1 | 0.1% | |
| 1.766666667 | 1 | 0.1% | |
| 2.466666667 | 1 | 0.1% | |
| 2.5 | 1 | 0.1% | |
| 2.7 | 2 | 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 634.4 | 1 | 0.1% | |
| 407.7333333 | 1 | 0.1% | |
| 389.9666667 | 1 | 0.1% | |
| 369.3333333 | 1 | 0.1% | |
| 365.5666667 | 1 | 0.1% |
coligada_mais_nova_baixada
Categorical
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | > 99.9% |
| Missing (n) | 1999 |
| 83.96666667 | 1 |
|---|---|
| (Missing) |
| Value | Count | Frequency (%) | |
| 83.96666667 | 1 | 0.1% | |
| (Missing) | 1999 | > 99.9% |
| Max length | 17 |
|---|---|
| Mean length | 3.007 |
| Min length | 3 |
| Contains chars | True |
| Contains digits | True |
| Contains spaces | False |
| Contains non-words | True |
de_faixa_faturamento_estimado
Categorical
| Distinct count | 10 |
|---|---|
| Unique (%) | 0.5% |
| Missing (%) | 5.9% |
| Missing (n) | 118 |
| DE R$ 81.000,01 A R$ 360.000,00 | |
|---|---|
| ATE R$ 81.000,00 | |
| DE R$ 360.000,01 A R$ 1.500.000,00 | 200 |
| Other values (6) | 73 |
| (Missing) | 118 |
| Value | Count | Frequency (%) | |
| DE R$ 81.000,01 A R$ 360.000,00 | 1166 | 58.3% | |
| ATE R$ 81.000,00 | 443 | 22.1% | |
| DE R$ 360.000,01 A R$ 1.500.000,00 | 200 | 10.0% | |
| DE R$ 1.500.000,01 A R$ 4.800.000,00 | 51 | 2.5% | |
| SEM INFORMACAO | 6 | 0.3% | |
| DE R$ 4.800.000,01 A R$ 10.000.000,00 | 6 | 0.3% | |
| DE R$ 10.000.000,01 A R$ 30.000.000,00 | 6 | 0.3% | |
| DE R$ 30.000.000,01 A R$ 100.000.000,00 | 3 | 0.1% | |
| DE R$ 300.000.000,01 A R$ 500.000.000,00 | 1 | 0.1% | |
| (Missing) | 118 | 5.9% |
| Max length | 40 |
|---|---|
| Mean length | 26.4575 |
| Min length | 3 |
| Contains chars | True |
| Contains digits | True |
| Contains spaces | True |
| Contains non-words | True |
de_faixa_faturamento_estimado_grupo
Categorical
| Distinct count | 12 |
|---|---|
| Unique (%) | 0.6% |
| Missing (%) | 5.9% |
| Missing (n) | 118 |
| DE R$ 81.000,01 A R$ 360.000,00 | |
|---|---|
| ATE R$ 81.000,00 | |
| DE R$ 360.000,01 A R$ 1.500.000,00 | |
| Other values (8) | 116 |
| (Missing) | 118 |
| Value | Count | Frequency (%) | |
| DE R$ 81.000,01 A R$ 360.000,00 | 1094 | 54.7% | |
| ATE R$ 81.000,00 | 438 | 21.9% | |
| DE R$ 360.000,01 A R$ 1.500.000,00 | 234 | 11.7% | |
| DE R$ 1.500.000,01 A R$ 4.800.000,00 | 51 | 2.5% | |
| DE R$ 4.800.000,01 A R$ 10.000.000,00 | 16 | 0.8% | |
| ACIMA DE 1 BILHAO DE REAIS | 16 | 0.8% | |
| DE R$ 10.000.000,01 A R$ 30.000.000,00 | 12 | 0.6% | |
| DE R$ 30.000.000,01 A R$ 100.000.000,00 | 8 | 0.4% | |
| DE R$ 100.000.000,01 A R$ 300.000.000,00 | 6 | 0.3% | |
| DE R$ 500.000.000,01 A 1 BILHAO DE REAIS | 5 | 0.2% | |
| (Missing) | 118 | 5.9% |
| Max length | 40 |
|---|---|
| Mean length | 26.682 |
| Min length | 3 |
| Contains chars | True |
| Contains digits | True |
| Contains spaces | True |
| Contains non-words | True |
de_indicador_telefone
Categorical
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 90.2% |
| Missing (n) | 1804 |
| BOA | 196 |
|---|---|
| (Missing) |
| Value | Count | Frequency (%) | |
| BOA | 196 | 9.8% | |
| (Missing) | 1804 | 90.2% |
| Max length | 3 |
|---|---|
| Mean length | 3 |
| Min length | 3 |
| Contains chars | True |
| Contains digits | False |
| Contains spaces | False |
| Contains non-words | False |
de_natureza_juridica
Categorical
| Distinct count | 33 |
|---|---|
| Unique (%) | 1.7% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| EMPRESARIO INDIVIDUAL | |
|---|---|
| SOCIEDADE EMPRESARIA LIMITADA | |
| ASSOCIACAO PRIVADA | 115 |
| Other values (30) | 183 |
| Value | Count | Frequency (%) | |
| EMPRESARIO INDIVIDUAL | 1306 | 65.3% | |
| SOCIEDADE EMPRESARIA LIMITADA | 396 | 19.8% | |
| ASSOCIACAO PRIVADA | 115 | 5.8% | |
| EMPRESA INDIVIDUAL DE RESPONSABILIDADE LIMITADA DE NATUREZA EMPRESARIA | 68 | 3.4% | |
| ORGAO DE DIRECAO LOCAL DE PARTIDO POLITICO | 24 | 1.2% | |
| CANDIDATO A CARGO POLITICO ELETIVO | 12 | 0.6% | |
| ENTIDADE SINDICAL | 7 | 0.4% | |
| CONDOMINIO EDILICIO | 7 | 0.4% | |
| ORGANIZACAO RELIGIOSA | 7 | 0.4% | |
| SOCIEDADE SIMPLES LIMITADA | 6 | 0.3% | |
| Other values (23) | 52 | 2.6% |
| Max length | 70 |
|---|---|
| Mean length | 24.597 |
| Min length | 9 |
| Contains chars | True |
| Contains digits | False |
| Contains spaces | True |
| Contains non-words | True |
de_nivel_atividade
Categorical
| Distinct count | 5 |
|---|---|
| Unique (%) | 0.2% |
| Missing (%) | 2.5% |
| Missing (n) | 51 |
| MEDIA | |
|---|---|
| ALTA | |
| BAIXA | |
| (Missing) | 51 |
| Value | Count | Frequency (%) | |
| MEDIA | 933 | 46.7% | |
| ALTA | 680 | 34.0% | |
| BAIXA | 321 | 16.1% | |
| MUITO BAIXA | 15 | 0.8% | |
| (Missing) | 51 | 2.5% |
| Max length | 11 |
|---|---|
| Mean length | 4.654 |
| Min length | 3 |
| Contains chars | True |
| Contains digits | False |
| Contains spaces | True |
| Contains non-words | True |
de_ramo
Categorical
| Distinct count | 31 |
|---|---|
| Unique (%) | 1.6% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| COMERCIO VAREJISTA | |
|---|---|
| SERVICOS DIVERSOS | |
| SERVICOS DE ALOJAMENTO/ALIMENTACAO | 133 |
| Other values (28) |
| Value | Count | Frequency (%) | |
| COMERCIO VAREJISTA | 749 | 37.5% | |
| SERVICOS DIVERSOS | 229 | 11.5% | |
| SERVICOS DE ALOJAMENTO/ALIMENTACAO | 133 | 6.7% | |
| INDUSTRIA DA CONSTRUCAO | 122 | 6.1% | |
| COMERCIO E REPARACAO DE VEICULOS | 115 | 5.8% | |
| SERVICOS ADMINISTRATIVOS | 102 | 5.1% | |
| BENS DE CONSUMO | 85 | 4.2% | |
| SERVICOS PROFISSIONAIS, TECNICOS E CIENTIFICOS | 78 | 3.9% | |
| COMERCIO POR ATACADO | 63 | 3.1% | |
| TRANSPORTE, ARMAZENAGEM E CORREIO | 59 | 2.9% | |
| Other values (21) | 265 | 13.2% |
| Max length | 49 |
|---|---|
| Mean length | 22.1755 |
| Min length | 6 |
| Contains chars | True |
| Contains digits | False |
| Contains spaces | True |
| Contains non-words | True |
de_saude_rescencia
Categorical
| Distinct count | 4 |
|---|---|
| Unique (%) | 0.2% |
| Missing (%) | 3.2% |
| Missing (n) | 64 |
| ACIMA DE 1 ANO | |
|---|---|
| ATE 1 ANO | 149 |
| SEM INFORMACAO | 135 |
| (Missing) | 64 |
| Value | Count | Frequency (%) | |
| ACIMA DE 1 ANO | 1652 | 82.6% | |
| ATE 1 ANO | 149 | 7.4% | |
| SEM INFORMACAO | 135 | 6.8% | |
| (Missing) | 64 | 3.2% |
| Max length | 14 |
|---|---|
| Mean length | 13.2755 |
| Min length | 3 |
| Contains chars | True |
| Contains digits | True |
| Contains spaces | True |
| Contains non-words | True |
de_saude_tributaria
Categorical
| Distinct count | 7 |
|---|---|
| Unique (%) | 0.4% |
| Missing (%) | 3.2% |
| Missing (n) | 64 |
| VERDE | |
|---|---|
| AZUL | |
| AMARELO | |
| Other values (3) |
| Value | Count | Frequency (%) | |
| VERDE | 679 | 34.0% | |
| AZUL | 471 | 23.5% | |
| AMARELO | 351 | 17.5% | |
| CINZA | 262 | 13.1% | |
| LARANJA | 150 | 7.5% | |
| VERMELHO | 23 | 1.1% | |
| (Missing) | 64 | 3.2% |
| Max length | 8 |
|---|---|
| Mean length | 5.236 |
| Min length | 3 |
| Contains chars | True |
| Contains digits | False |
| Contains spaces | False |
| Contains non-words | False |
dt_situacao
Categorical
| Distinct count | 1315 |
|---|---|
| Unique (%) | 65.8% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| 2005-11-03 | 285 |
|---|---|
| 2006-12-01 | 13 |
| 2005-09-24 | 8 |
| Other values (1312) |
| Value | Count | Frequency (%) | |
| 2005-11-03 | 285 | 14.2% | |
| 2006-12-01 | 13 | 0.7% | |
| 2005-09-24 | 8 | 0.4% | |
| 2018-08-14 | 7 | 0.4% | |
| 2006-12-21 | 7 | 0.4% | |
| 2005-08-27 | 7 | 0.4% | |
| 2006-12-02 | 7 | 0.4% | |
| 2004-10-30 | 7 | 0.4% | |
| 2010-05-15 | 6 | 0.3% | |
| 1998-07-28 | 6 | 0.3% | |
| Other values (1305) | 1647 | 82.3% |
| Max length | 10 |
|---|---|
| Mean length | 10 |
| Min length | 10 |
| Contains chars | False |
| Contains digits | True |
| Contains spaces | False |
| Contains non-words | True |
empsetorcensitariofaixarendapopulacao
Numeric
| Distinct count | 1231 |
|---|---|
| Unique (%) | 61.6% |
| Missing (%) | 30.0% |
| Missing (n) | 600 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 1342.215271 |
|---|---|
| Minimum | 169.28 |
| Maximum | 30861.81 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 169.28 |
|---|---|
| 5-th percentile | 449.453 |
| Q1 | 686.725 |
| Median | 943.72 |
| Q3 | 1539.025 |
| 95-th percentile | 3653.926 |
| Maximum | 30861.81 |
| Range | 30692.53 |
| Interquartile range | 852.3 |
Descriptive statistics
| Standard deviation | 1340.318374 |
|---|---|
| Coef of variation | 0.9985867411 |
| Kurtosis | 173.6273001 |
| Mean | 1342.215271 |
| MAD | 752.9107055 |
| Skewness | 9.216418036 |
| Sum | 1879101.38 |
| Variance | 1796453.343 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 1549.1 | 9 | 0.4% | |
| 1086.05 | 6 | 0.3% | |
| 845.75 | 4 | 0.2% | |
| 2522.76 | 4 | 0.2% | |
| 1375.69 | 4 | 0.2% | |
| 1116.52 | 4 | 0.2% | |
| 786.74 | 3 | 0.1% | |
| 452.13 | 3 | 0.1% | |
| 1939.11 | 3 | 0.1% | |
| 1519.63 | 3 | 0.1% | |
| Other values (1220) | 1357 | 67.8% | |
| (Missing) | 600 | 30.0% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 169.28 | 1 | 0.1% | |
| 205.66 | 1 | 0.1% | |
| 247.88 | 1 | 0.1% | |
| 255.93 | 1 | 0.1% | |
| 262.44 | 1 | 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 30861.81 | 1 | 0.1% | |
| 12512.93 | 1 | 0.1% | |
| 10039.2 | 1 | 0.1% | |
| 7716.18 | 1 | 0.1% | |
| 6641.43 | 1 | 0.1% |
faturamento_est_coligados
Numeric
| Distinct count | 135 |
|---|---|
| Unique (%) | 6.8% |
| Missing (%) | 86.9% |
| Missing (n) | 1738 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 325097449.3 |
|---|---|
| Minimum | 50000 |
| Maximum | 2.775903576e+10 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 50000 |
|---|---|
| 5-th percentile | 186684.7141 |
| Q1 | 210000 |
| Median | 680011.2109 |
| Q3 | 2515005.594 |
| 95-th percentile | 151386514.8 |
| Maximum | 2.775903576e+10 |
| Range | 2.775898576e+10 |
| Interquartile range | 2305005.594 |
Descriptive statistics
| Standard deviation | 2619159438 |
|---|---|
| Coef of variation | 8.056536414 |
| Kurtosis | 93.97533198 |
| Mean | 325097449.3 |
| MAD | 613818086.9 |
| Skewness | 9.528882491 |
| Sum | 8.517553171e+10 |
| Variance | 6.859996162e+18 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 210000 | 68 | 3.4% | |
| 420000 | 14 | 0.7% | |
| 930000 | 11 | 0.5% | |
| 630000 | 7 | 0.4% | |
| 185457.5938 | 6 | 0.3% | |
| 370915.1875 | 5 | 0.2% | |
| 50000 | 5 | 0.2% | |
| 989107.1875 | 3 | 0.1% | |
| 123638.3984 | 3 | 0.1% | |
| 1260000 | 3 | 0.1% | |
| Other values (124) | 137 | 6.9% | |
| (Missing) | 1738 | 86.9% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 50000 | 5 | 0.2% | |
| 123638.3984 | 3 | 0.1% | |
| 185457.5938 | 6 | 0.3% | |
| 210000 | 68 | 3.4% | |
| 247276.7969 | 2 | 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 2.775903576e+10 | 1 | 0.1% | |
| 2.753563853e+10 | 1 | 0.1% | |
| 1.508602154e+10 | 1 | 0.1% | |
| 7257133548 | 1 | 0.1% | |
| 2143498799 | 1 | 0.1% |
faturamento_est_coligados_gp
Highly correlated
This variable is highly correlated with faturamento_est_coligados and should be ignored for analysis
| Correlation | 0.9509546939 |
|---|
fl_antt
Boolean
| Distinct count | 3 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 0.5% |
| Missing (n) | 11 |
| False | |
|---|---|
| True | 6 |
| (Missing) | 11 |
| Value | Count | Frequency (%) | |
| False | 1983 | 99.2% | |
| True | 6 | 0.3% | |
| (Missing) | 11 | 0.5% |
fl_email
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) | |
| False | 1071 | 53.5% | |
| True | 929 | 46.5% |
fl_epp
Constant
This variable is constant and should be ignored for analysis
| Constant value | False |
|---|
fl_ltda
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| False | |
|---|---|
| True | 3 |
| Value | Count | Frequency (%) | |
| False | 1997 | 99.9% | |
| True | 3 | 0.1% |
fl_matriz
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| True | |
|---|---|
| False | 107 |
| Value | Count | Frequency (%) | |
| True | 1893 | 94.7% | |
| False | 107 | 5.3% |
fl_me
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| False | |
|---|---|
| True | 4 |
| Value | Count | Frequency (%) | |
| False | 1996 | 99.8% | |
| True | 4 | 0.2% |
fl_mei
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) | |
| False | 1339 | 67.0% | |
| True | 661 | 33.1% |
fl_optante_simei
Boolean
| Distinct count | 3 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 17.3% |
| Missing (n) | 346 |
| False | |
|---|---|
| True | |
| (Missing) |
| Value | Count | Frequency (%) | |
| False | 1233 | 61.7% | |
| True | 421 | 21.1% | |
| (Missing) | 346 | 17.3% |
fl_optante_simples
Boolean
| Distinct count | 3 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 17.3% |
| Missing (n) | 346 |
| True | |
|---|---|
| False | |
| (Missing) |
| Value | Count | Frequency (%) | |
| True | 913 | 45.6% | |
| False | 741 | 37.0% | |
| (Missing) | 346 | 17.3% |
fl_passivel_iss
Boolean
| Distinct count | 3 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 0.5% |
| Missing (n) | 11 |
| True | |
|---|---|
| False | |
| (Missing) | 11 |
| Value | Count | Frequency (%) | |
| True | 1141 | 57.0% | |
| False | 848 | 42.4% | |
| (Missing) | 11 | 0.5% |
fl_rm
Categorical
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| NAO | |
|---|---|
| SIM |
| Value | Count | Frequency (%) | |
| NAO | 1008 | 50.4% | |
| SIM | 992 | 49.6% |
| Max length | 3 |
|---|---|
| Mean length | 3 |
| Min length | 3 |
| Contains chars | True |
| Contains digits | False |
| Contains spaces | False |
| Contains non-words | False |
fl_sa
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| False | |
|---|---|
| True | 36 |
| Value | Count | Frequency (%) | |
| False | 1964 | 98.2% | |
| True | 36 | 1.8% |
fl_simples_irregular
Boolean
| Distinct count | 3 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 0.5% |
| Missing (n) | 11 |
| False | |
|---|---|
| True | 2 |
| (Missing) | 11 |
| Value | Count | Frequency (%) | |
| False | 1987 | 99.4% | |
| True | 2 | 0.1% | |
| (Missing) | 11 | 0.5% |
fl_spa
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 0.5% |
| Missing (n) | 11 |
| False | |
|---|---|
| (Missing) | 11 |
| Value | Count | Frequency (%) | |
| False | 1989 | 99.5% | |
| (Missing) | 11 | 0.5% |
fl_st_especial
Constant
This variable is constant and should be ignored for analysis
| Constant value | False |
|---|
fl_telefone
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| True | |
|---|---|
| False |
| Value | Count | Frequency (%) | |
| True | 1459 | 73.0% | |
| False | 541 | 27.1% |
fl_veiculo
Boolean
| Distinct count | 3 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 0.5% |
| Missing (n) | 11 |
| False | |
|---|---|
| True | 136 |
| (Missing) | 11 |
| Value | Count | Frequency (%) | |
| False | 1853 | 92.7% | |
| True | 136 | 6.8% | |
| (Missing) | 11 | 0.5% |
grau_instrucao_macro_analfabeto
Categorical
| Distinct count | 3 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 99.7% |
| Missing (n) | 1994 |
| 1 | 5 |
|---|---|
| 3 | 1 |
| (Missing) |
| Value | Count | Frequency (%) | |
| 1 | 5 | 0.2% | |
| 3 | 1 | 0.1% | |
| (Missing) | 1994 | 99.7% |
| Max length | 3 |
|---|---|
| Mean length | 3 |
| Min length | 3 |
| Contains chars | True |
| Contains digits | True |
| Contains spaces | False |
| Contains non-words | True |
grau_instrucao_macro_desconhecido
Constant
This variable is constant and should be ignored for analysis
| Constant value | nan |
|---|
grau_instrucao_macro_escolaridade_fundamental
Highly correlated
This variable is highly correlated with grau_instrucao_macro_analfabeto and should be ignored for analysis
| Correlation | 0.9980369909 |
|---|
grau_instrucao_macro_escolaridade_media
Numeric
| Distinct count | 37 |
|---|---|
| Unique (%) | 1.8% |
| Missing (%) | 84.7% |
| Missing (n) | 1694 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 8.637254902 |
|---|---|
| Minimum | 1 |
| Maximum | 523 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| Median | 3 |
| Q3 | 6 |
| 95-th percentile | 24.5 |
| Maximum | 523 |
| Range | 522 |
| Interquartile range | 5 |
Descriptive statistics
| Standard deviation | 33.48847295 |
|---|---|
| Coef of variation | 3.877212532 |
| Kurtosis | 185.9186412 |
| Mean | 8.637254902 |
| MAD | 9.679418172 |
| Skewness | 12.63630163 |
| Sum | 2643 |
| Variance | 1121.477821 |
| Memory size | 31.2 KiB |
| Value | Count | Frequency (%) | |
| 1 | 89 | 4.5% | |
| 2 | 56 | 2.8% | |
| 3 | 34 | 1.7% | |
| 4 | 25 | 1.2% | |
| 5 | 17 | 0.9% | |
| 6 | 17 | 0.9% | |
| 9 | 7 | 0.4% | |
| 7 | 7 | 0.4% | |
| 8 | 6 | 0.3% | |
| 10 | 6 | 0.3% | |
| Other values (26) | 42 | 2.1% | |
| (Missing) | 1694 | 84.7% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 1 | 89 | 4.5% | |
| 2 | 56 | 2.8% | |
| 3 | 34 | 1.7% | |
| 4 | 25 | 1.2% | |
| 5 | 17 | 0.9% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 523 | 1 | 0.1% | |
| 177 | 1 | 0.1% | |
| 120 | 1 | 0.1% | |
| 94 | 1 | 0.1% | |
| 82 | 1 | 0.1% |
grau_instrucao_macro_escolaridade_superior
Highly correlated
This variable is highly correlated with grau_instrucao_macro_escolaridade_media and should be ignored for analysis
| Correlation | 0.9178829036 |
|---|
id
Categorical, Unique
| First 5 values |
|---|
| 00123b6e449556823ba4aac6dbb35b44f60557c511566f... |
| 0032e3e6a776cbf4d36efa963b4eda224ddba8af284117... |
| 0036afe9a1be13d5389ab0d8f8cd2217c8c6076345298c... |
| 008fd7836462aaecf8b8d335153ada057577156875fae4... |
| 00c30fb1779762241dd1ca7d1baab00c24b2f87294e573... |
| Last 5 values |
|---|
| ffba50aa6f4af7271d8f9738018b0f0329c89b05fe5971... |
| ffc04bac625e91c7ebad8dc98943264661aca3a913b5f6... |
| ffd37fc0ee78a4555d0c18e10b3779e54581142875f0ad... |
| ffd81231de150c50400d79d1740ca8108eda5b339beb85... |
| ffed1e47eaf7b3444605cd7cb91bf9ef7cf3bbe9f7f730... |
First 5 values
| Value | Count | Frequency (%) | |
| 00123b6e449556823ba4aac6dbb35b44f60557c511566f838dc889b75c6f9af1 | 1 | 0.1% | |
| 0032e3e6a776cbf4d36efa963b4eda224ddba8af284117273bbd7a2a9d374f96 | 1 | 0.1% | |
| 0036afe9a1be13d5389ab0d8f8cd2217c8c6076345298cb451f3634f44294ae0 | 1 | 0.1% | |
| 008fd7836462aaecf8b8d335153ada057577156875fae4933a9fc16a2a44e0d0 | 1 | 0.1% | |
| 00c30fb1779762241dd1ca7d1baab00c24b2f87294e573415ccfa23fda43c270 | 1 | 0.1% |
Last 5 values
| Value | Count | Frequency (%) | |
| ffed1e47eaf7b3444605cd7cb91bf9ef7cf3bbe9f7f73092c10d21a1d454d1fd | 1 | 0.1% | |
| ffd81231de150c50400d79d1740ca8108eda5b339beb850646cdd9424bae405e | 1 | 0.1% | |
| ffd37fc0ee78a4555d0c18e10b3779e54581142875f0ad31f9ba0406fd5a9b2d | 1 | 0.1% | |
| ffc04bac625e91c7ebad8dc98943264661aca3a913b5f634c2ae69a947772e13 | 1 | 0.1% | |
| ffba50aa6f4af7271d8f9738018b0f0329c89b05fe597100f833a07065a2c417 | 1 | 0.1% |
idade_acima_de_58
Highly correlated
This variable is highly correlated with grau_instrucao_macro_analfabeto and should be ignored for analysis
| Correlation | 1 |
|---|
idade_ate_18
Categorical
| Distinct count | 4 |
|---|---|
| Unique (%) | 0.2% |
| Missing (%) | 99.5% |
| Missing (n) | 1990 |
| 1 | 7 |
|---|---|
| 3 | 2 |
| 2 | 1 |
| (Missing) |
| Value | Count | Frequency (%) | |
| 1 | 7 | 0.4% | |
| 3 | 2 | 0.1% | |
| 2 | 1 | 0.1% | |
| (Missing) | 1990 | 99.5% |
| Max length | 3 |
|---|---|
| Mean length | 3 |
| Min length | 3 |
| Contains chars | True |
| Contains digits | True |
| Contains spaces | False |
| Contains non-words | True |
idade_de_19_a_23
Numeric
| Distinct count | 13 |
|---|---|
| Unique (%) | 0.7% |
| Missing (%) | 94.0% |
| Missing (n) | 1879 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 2.462809917 |
|---|---|
| Minimum | 1 |
| Maximum | 43 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| Median | 1 |
| Q3 | 2 |
| 95-th percentile | 6 |
| Maximum | 43 |
| Range | 42 |
| Interquartile range | 1 |
Descriptive statistics
| Standard deviation | 4.44792334 |
|---|---|
| Coef of variation | 1.806035987 |
| Kurtosis | 58.70447671 |
| Mean | 2.462809917 |
| MAD | 1.958745987 |
| Skewness | 6.926566897 |
| Sum | 298 |
| Variance | 19.78402204 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 1 | 75 | 3.8% | |
| 2 | 19 | 0.9% | |
| 3 | 10 | 0.5% | |
| 4 | 6 | 0.3% | |
| 5 | 4 | 0.2% | |
| 11 | 1 | 0.1% | |
| 12 | 1 | 0.1% | |
| 14 | 1 | 0.1% | |
| 10 | 1 | 0.1% | |
| 6 | 1 | 0.1% | |
| Other values (2) | 2 | 0.1% | |
| (Missing) | 1879 | 94.0% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 1 | 75 | 3.8% | |
| 2 | 19 | 0.9% | |
| 3 | 10 | 0.5% | |
| 4 | 6 | 0.3% | |
| 5 | 4 | 0.2% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 43 | 1 | 0.1% | |
| 15 | 1 | 0.1% | |
| 14 | 1 | 0.1% | |
| 12 | 1 | 0.1% | |
| 11 | 1 | 0.1% |
idade_de_24_a_28
Highly correlated
This variable is highly correlated with idade_de_19_a_23 and should be ignored for analysis
| Correlation | 0.9086682511 |
|---|
idade_de_29_a_33
Highly correlated
This variable is highly correlated with grau_instrucao_macro_escolaridade_superior and should be ignored for analysis
| Correlation | 0.9749169386 |
|---|
idade_de_34_a_38
Highly correlated
This variable is highly correlated with idade_de_29_a_33 and should be ignored for analysis
| Correlation | 0.994053508 |
|---|
idade_de_39_a_43
Highly correlated
This variable is highly correlated with idade_de_34_a_38 and should be ignored for analysis
| Correlation | 0.9834868533 |
|---|
idade_de_44_a_48
Highly correlated
This variable is highly correlated with idade_de_39_a_43 and should be ignored for analysis
| Correlation | 0.9491747935 |
|---|
idade_de_49_a_53
Highly correlated
This variable is highly correlated with grau_instrucao_macro_analfabeto and should be ignored for analysis
| Correlation | 0.9891584832 |
|---|
idade_de_54_a_58
Highly correlated
This variable is highly correlated with idade_de_49_a_53 and should be ignored for analysis
| Correlation | 0.9617506252 |
|---|
idade_emp_cat
Categorical
| Distinct count | 6 |
|---|---|
| Unique (%) | 0.3% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| 1 a 5 | |
|---|---|
| 5 a 10 | |
| > 20 | |
| Other values (3) |
| Value | Count | Frequency (%) | |
| 1 a 5 | 609 | 30.4% | |
| 5 a 10 | 523 | 26.2% | |
| > 20 | 329 | 16.4% | |
| 10 a 15 | 196 | 9.8% | |
| <= 1 | 195 | 9.8% | |
| 15 a 20 | 148 | 7.4% |
| Max length | 7 |
|---|---|
| Mean length | 5.3435 |
| Min length | 4 |
| Contains chars | True |
| Contains digits | True |
| Contains spaces | True |
| Contains non-words | True |
idade_empresa_anos
Numeric
| Distinct count | 1669 |
|---|---|
| Unique (%) | 83.5% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 9.945464384 |
|---|---|
| Minimum | 0.0301369863 |
| Maximum | 52.11506849 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 0.0301369863 |
|---|---|
| 5-th percentile | 0.4712328767 |
| Q1 | 2.700684932 |
| Median | 6.646575342 |
| Q3 | 14.39109589 |
| 95-th percentile | 30.71863014 |
| Maximum | 52.11506849 |
| Range | 52.08493151 |
| Interquartile range | 11.69041096 |
Descriptive statistics
| Standard deviation | 9.690644903 |
|---|---|
| Coef of variation | 0.9743783226 |
| Kurtosis | 1.386651358 |
| Mean | 9.945464384 |
| MAD | 7.607123093 |
| Skewness | 1.376674689 |
| Sum | 19890.92877 |
| Variance | 93.90859864 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 0.2054794521 | 7 | 0.4% | |
| 0.2082191781 | 5 | 0.2% | |
| 0.6054794521 | 4 | 0.2% | |
| 1.254794521 | 4 | 0.2% | |
| 2.394520548 | 4 | 0.2% | |
| 3.460273973 | 4 | 0.2% | |
| 1.02739726 | 4 | 0.2% | |
| 2.216438356 | 4 | 0.2% | |
| 3.060273973 | 4 | 0.2% | |
| 5.194520548 | 4 | 0.2% | |
| Other values (1659) | 1956 | 97.8% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0.0301369863 | 2 | 0.1% | |
| 0.04657534247 | 1 | 0.1% | |
| 0.04931506849 | 1 | 0.1% | |
| 0.05205479452 | 1 | 0.1% | |
| 0.05479452055 | 2 | 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 52.11506849 | 1 | 0.1% | |
| 51.15342466 | 1 | 0.1% | |
| 49.43835616 | 1 | 0.1% | |
| 48.27945205 | 1 | 0.1% | |
| 47.3260274 | 1 | 0.1% |
idade_maxima_coligadas
Highly correlated
This variable is highly correlated with coligada_mais_antiga_ativa and should be ignored for analysis
| Correlation | 0.9997565556 |
|---|
idade_maxima_socios
Numeric
| Distinct count | 73 |
|---|---|
| Unique (%) | 3.6% |
| Missing (%) | 33.0% |
| Missing (n) | 659 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 44.68680089 |
|---|---|
| Minimum | 18 |
| Maximum | 96 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 18 |
|---|---|
| 5-th percentile | 25 |
| Q1 | 34 |
| Median | 43 |
| Q3 | 54 |
| 95-th percentile | 70 |
| Maximum | 96 |
| Range | 78 |
| Interquartile range | 20 |
Descriptive statistics
| Standard deviation | 13.94082527 |
|---|---|
| Coef of variation | 0.3119674041 |
| Kurtosis | -0.07141409063 |
| Mean | 44.68680089 |
| MAD | 11.35869088 |
| Skewness | 0.5431259797 |
| Sum | 59925 |
| Variance | 194.3466092 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 41 | 46 | 2.3% | |
| 35 | 43 | 2.1% | |
| 30 | 40 | 2.0% | |
| 32 | 40 | 2.0% | |
| 40 | 39 | 1.9% | |
| 45 | 38 | 1.9% | |
| 47 | 38 | 1.9% | |
| 48 | 37 | 1.8% | |
| 39 | 37 | 1.8% | |
| 33 | 37 | 1.8% | |
| Other values (62) | 946 | 47.3% | |
| (Missing) | 659 | 33.0% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 18 | 1 | 0.1% | |
| 19 | 4 | 0.2% | |
| 20 | 7 | 0.4% | |
| 21 | 7 | 0.4% | |
| 22 | 14 | 0.7% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 96 | 1 | 0.1% | |
| 90 | 1 | 0.1% | |
| 87 | 2 | 0.1% | |
| 86 | 2 | 0.1% | |
| 85 | 1 | 0.1% |
idade_media_coligadas
Numeric
| Distinct count | 262 |
|---|---|
| Unique (%) | 13.1% |
| Missing (%) | 86.9% |
| Missing (n) | 1737 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 159.2109782 |
|---|---|
| Minimum | 1.766666667 |
| Maximum | 634.4 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 1.766666667 |
|---|---|
| 5-th percentile | 37.77666667 |
| Q1 | 88.3 |
| Median | 148.6333333 |
| Q3 | 213.65 |
| 95-th percentile | 333.7066667 |
| Maximum | 634.4 |
| Range | 632.6333333 |
| Interquartile range | 125.35 |
Descriptive statistics
| Standard deviation | 92.85948753 |
|---|---|
| Coef of variation | 0.5832480184 |
| Kurtosis | 2.209191276 |
| Mean | 159.2109782 |
| MAD | 72.52905633 |
| Skewness | 1.064459267 |
| Sum | 41872.48726 |
| Variance | 8622.884424 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 243.7 | 2 | 0.1% | |
| 87.64444444 | 2 | 0.1% | |
| 262.6 | 1 | 0.1% | |
| 85.85 | 1 | 0.1% | |
| 137.8833333 | 1 | 0.1% | |
| 23.76666667 | 1 | 0.1% | |
| 154.15 | 1 | 0.1% | |
| 121.4111111 | 1 | 0.1% | |
| 47.23333333 | 1 | 0.1% | |
| 16.93333333 | 1 | 0.1% | |
| Other values (251) | 251 | 12.6% | |
| (Missing) | 1737 | 86.9% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 1.766666667 | 1 | 0.1% | |
| 5.266666667 | 1 | 0.1% | |
| 11.26666667 | 1 | 0.1% | |
| 16.93333333 | 1 | 0.1% | |
| 23.76666667 | 1 | 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 634.4 | 1 | 0.1% | |
| 445.125 | 1 | 0.1% | |
| 407.7333333 | 1 | 0.1% | |
| 395.3333333 | 1 | 0.1% | |
| 392.85 | 1 | 0.1% |
idade_media_coligadas_ativas
Highly correlated
This variable is highly correlated with idade_media_coligadas and should be ignored for analysis
| Correlation | 0.9996384056 |
|---|
idade_media_coligadas_baixadas
Categorical
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | > 99.9% |
| Missing (n) | 1999 |
| 83.96666667 | 1 |
|---|---|
| (Missing) |
| Value | Count | Frequency (%) | |
| 83.96666667 | 1 | 0.1% | |
| (Missing) | 1999 | > 99.9% |
| Max length | 17 |
|---|---|
| Mean length | 3.007 |
| Min length | 3 |
| Contains chars | True |
| Contains digits | True |
| Contains spaces | False |
| Contains non-words | True |
idade_media_socios
Highly correlated
This variable is highly correlated with idade_maxima_socios and should be ignored for analysis
| Correlation | 0.9554366317 |
|---|
idade_minima_coligadas
Highly correlated
This variable is highly correlated with coligada_mais_nova_ativa and should be ignored for analysis
| Correlation | 1 |
|---|
idade_minima_socios
Highly correlated
This variable is highly correlated with idade_media_socios and should be ignored for analysis
| Correlation | 0.9505781936 |
|---|
max_faturamento_est_coligados
Highly correlated
This variable is highly correlated with faturamento_est_coligados_gp and should be ignored for analysis
| Correlation | 0.9359933557 |
|---|
max_faturamento_est_coligados_gp
Highly correlated
This variable is highly correlated with max_faturamento_est_coligados and should be ignored for analysis
| Correlation | 0.9599434338 |
|---|
max_filiais_coligados
Numeric
| Distinct count | 22 |
|---|---|
| Unique (%) | 1.1% |
| Missing (%) | 96.0% |
| Missing (n) | 1919 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 12.38271605 |
|---|---|
| Minimum | 1 |
| Maximum | 349 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| Median | 2 |
| Q3 | 5 |
| 95-th percentile | 46 |
| Maximum | 349 |
| Range | 348 |
| Interquartile range | 4 |
Descriptive statistics
| Standard deviation | 42.61999762 |
|---|---|
| Coef of variation | 3.441894125 |
| Kurtosis | 50.59629216 |
| Mean | 12.38271605 |
| MAD | 17.06233806 |
| Skewness | 6.727505967 |
| Sum | 1003 |
| Variance | 1816.464198 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 1 | 37 | 1.8% | |
| 2 | 16 | 0.8% | |
| 3 | 4 | 0.2% | |
| 5 | 3 | 0.1% | |
| 8 | 2 | 0.1% | |
| 6 | 2 | 0.1% | |
| 21 | 2 | 0.1% | |
| 4 | 2 | 0.1% | |
| 37 | 1 | 0.1% | |
| 7 | 1 | 0.1% | |
| Other values (11) | 11 | 0.5% | |
| (Missing) | 1919 | 96.0% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 1 | 37 | 1.8% | |
| 2 | 16 | 0.8% | |
| 3 | 4 | 0.2% | |
| 4 | 2 | 0.1% | |
| 5 | 3 | 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 349 | 1 | 0.1% | |
| 144 | 1 | 0.1% | |
| 67 | 1 | 0.1% | |
| 49 | 1 | 0.1% | |
| 46 | 1 | 0.1% |
max_funcionarios_coligados_gp
Numeric
| Distinct count | 61 |
|---|---|
| Unique (%) | 3.0% |
| Missing (%) | 91.5% |
| Missing (n) | 1830 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 277.2647059 |
|---|---|
| Minimum | 0 |
| Maximum | 13234 |
| Zeros (%) | 1.4% |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 2 |
| Median | 7 |
| Q3 | 21 |
| 95-th percentile | 917.5 |
| Maximum | 13234 |
| Range | 13234 |
| Interquartile range | 19 |
Descriptive statistics
| Standard deviation | 1358.472975 |
|---|---|
| Coef of variation | 4.899552471 |
| Kurtosis | 58.60802041 |
| Mean | 277.2647059 |
| MAD | 469.5352941 |
| Skewness | 7.258593967 |
| Sum | 47135 |
| Variance | 1845448.823 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 0 | 28 | 1.4% | |
| 1 | 14 | 0.7% | |
| 8 | 10 | 0.5% | |
| 3 | 9 | 0.4% | |
| 6 | 9 | 0.4% | |
| 2 | 8 | 0.4% | |
| 4 | 7 | 0.4% | |
| 5 | 7 | 0.4% | |
| 7 | 5 | 0.2% | |
| 12 | 4 | 0.2% | |
| Other values (50) | 69 | 3.5% | |
| (Missing) | 1830 | 91.5% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0 | 28 | 1.4% | |
| 1 | 14 | 0.7% | |
| 2 | 8 | 0.4% | |
| 3 | 9 | 0.4% | |
| 4 | 7 | 0.4% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 13234 | 1 | 0.1% | |
| 7646 | 1 | 0.1% | |
| 7612 | 1 | 0.1% | |
| 3719 | 1 | 0.1% | |
| 3278 | 1 | 0.1% |
max_meses_servicos
Highly correlated
This variable is highly correlated with grau_instrucao_macro_analfabeto and should be ignored for analysis
| Correlation | 0.9664890124 |
|---|
max_meses_servicos_all
Highly correlated
This variable is highly correlated with grau_instrucao_macro_analfabeto and should be ignored for analysis
| Correlation | 0.9664890124 |
|---|
max_vl_folha_coligados
Numeric
| Distinct count | 80 |
|---|---|
| Unique (%) | 4.0% |
| Missing (%) | 92.9% |
| Missing (n) | 1858 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 17759814.38 |
|---|---|
| Minimum | 20606.4 |
| Maximum | 1002336500 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 20606.4 |
|---|---|
| 5-th percentile | 61819.2 |
| Q1 | 185457.6 |
| Median | 556372.8 |
| Q3 | 1375477.2 |
| 95-th percentile | 49336874 |
| Maximum | 1002336500 |
| Range | 1002315894 |
| Interquartile range | 1190019.6 |
Descriptive statistics
| Standard deviation | 94405468.23 |
|---|---|
| Coef of variation | 5.315678767 |
| Kurtosis | 87.64999342 |
| Mean | 17759814.38 |
| MAD | 29605044.23 |
| Skewness | 8.890949332 |
| Sum | 2521893642 |
| Variance | 8.912392432e+15 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 61819.2 | 15 | 0.8% | |
| 247276.8 | 7 | 0.4% | |
| 123638.4 | 6 | 0.3% | |
| 185457.6 | 6 | 0.3% | |
| 494553.6 | 4 | 0.2% | |
| 329702.4 | 3 | 0.1% | |
| 432734.4 | 3 | 0.1% | |
| 618192 | 3 | 0.1% | |
| 309096 | 3 | 0.1% | |
| 206064 | 3 | 0.1% | |
| Other values (69) | 89 | 4.5% | |
| (Missing) | 1858 | 92.9% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 20606.4 | 2 | 0.1% | |
| 41212.8 | 1 | 0.1% | |
| 61819.2 | 15 | 0.8% | |
| 82425.6 | 2 | 0.1% | |
| 103032 | 2 | 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 1002336500 | 1 | 0.1% | |
| 442419400 | 1 | 0.1% | |
| 194709870 | 1 | 0.1% | |
| 134910100 | 1 | 0.1% | |
| 130871250 | 1 | 0.1% |
max_vl_folha_coligados_gp
Highly correlated
This variable is highly correlated with max_funcionarios_coligados_gp and should be ignored for analysis
| Correlation | 0.9048394021 |
|---|
media_faturamento_est_coligados
Numeric
| Distinct count | 130 |
|---|---|
| Unique (%) | 6.5% |
| Missing (%) | 86.9% |
| Missing (n) | 1738 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 13597728.61 |
|---|---|
| Minimum | 50000 |
| Maximum | 1321858846 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 50000 |
|---|---|
| 5-th percentile | 182433.1797 |
| Q1 | 210000 |
| Median | 310976.5714 |
| Q3 | 930000 |
| 95-th percentile | 14443904.07 |
| Maximum | 1321858846 |
| Range | 1321808846 |
| Interquartile range | 720000 |
Descriptive statistics
| Standard deviation | 97682086.91 |
|---|---|
| Coef of variation | 7.183706168 |
| Kurtosis | 129.9251177 |
| Mean | 13597728.61 |
| MAD | 24051830 |
| Skewness | 10.66475957 |
| Sum | 3562604897 |
| Variance | 9.541790104e+15 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 210000 | 95 | 4.8% | |
| 930000 | 12 | 0.6% | |
| 185457.5938 | 6 | 0.3% | |
| 370915.1875 | 5 | 0.2% | |
| 50000 | 5 | 0.2% | |
| 989107.1875 | 3 | 0.1% | |
| 166819.1992 | 2 | 0.1% | |
| 228638.3984 | 2 | 0.1% | |
| 182274 | 2 | 0.1% | |
| 123638.3984 | 2 | 0.1% | |
| Other values (119) | 128 | 6.4% | |
| (Missing) | 1738 | 86.9% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 50000 | 5 | 0.2% | |
| 61819.19922 | 1 | 0.1% | |
| 123638.3984 | 2 | 0.1% | |
| 153737.6003 | 1 | 0.1% | |
| 166819.1992 | 2 | 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 1321858846 | 1 | 0.1% | |
| 518366682 | 1 | 0.1% | |
| 502867384.6 | 1 | 0.1% | |
| 441742236 | 1 | 0.1% | |
| 235347337.9 | 1 | 0.1% |
media_faturamento_est_coligados_gp
Highly correlated
This variable is highly correlated with media_faturamento_est_coligados and should be ignored for analysis
| Correlation | 0.9503777089 |
|---|
media_filiais_coligados
Numeric
| Distinct count | 30 |
|---|---|
| Unique (%) | 1.5% |
| Missing (%) | 96.0% |
| Missing (n) | 1919 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 4.430688276 |
|---|---|
| Minimum | 1 |
| Maximum | 67 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| Median | 1.5 |
| Q3 | 3 |
| 95-th percentile | 14.5 |
| Maximum | 67 |
| Range | 66 |
| Interquartile range | 2 |
Descriptive statistics
| Standard deviation | 9.535549907 |
|---|---|
| Coef of variation | 2.152159961 |
| Kurtosis | 28.7355447 |
| Mean | 4.430688276 |
| MAD | 4.605814278 |
| Skewness | 5.072445182 |
| Sum | 358.8857504 |
| Variance | 90.92671204 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 1 | 37 | 1.8% | |
| 2 | 11 | 0.5% | |
| 1.5 | 4 | 0.2% | |
| 3 | 3 | 0.1% | |
| 4.5 | 2 | 0.1% | |
| 1.666666667 | 1 | 0.1% | |
| 14.5 | 1 | 0.1% | |
| 2.5 | 1 | 0.1% | |
| 4 | 1 | 0.1% | |
| 8.5 | 1 | 0.1% | |
| Other values (19) | 19 | 0.9% | |
| (Missing) | 1919 | 96.0% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 1 | 37 | 1.8% | |
| 1.333333333 | 1 | 0.1% | |
| 1.5 | 4 | 0.2% | |
| 1.666666667 | 1 | 0.1% | |
| 2 | 11 | 0.5% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 67 | 1 | 0.1% | |
| 49.4 | 1 | 0.1% | |
| 20.66666667 | 1 | 0.1% | |
| 18 | 1 | 0.1% | |
| 14.5 | 1 | 0.1% |
media_funcionarios_coligados_gp
Numeric
| Distinct count | 71 |
|---|---|
| Unique (%) | 3.5% |
| Missing (%) | 91.5% |
| Missing (n) | 1830 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 53.77076285 |
|---|---|
| Minimum | 0 |
| Maximum | 2162.357143 |
| Zeros (%) | 1.4% |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1.25 |
| Median | 6 |
| Q3 | 16 |
| 95-th percentile | 195.88125 |
| Maximum | 2162.357143 |
| Range | 2162.357143 |
| Interquartile range | 14.75 |
Descriptive statistics
| Standard deviation | 213.8766555 |
|---|---|
| Coef of variation | 3.977564092 |
| Kurtosis | 63.0354861 |
| Mean | 53.77076285 |
| MAD | 80.96236244 |
| Skewness | 7.368189486 |
| Sum | 9141.029685 |
| Variance | 45743.22378 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 0 | 28 | 1.4% | |
| 1 | 14 | 0.7% | |
| 6 | 10 | 0.5% | |
| 2 | 10 | 0.5% | |
| 3 | 9 | 0.4% | |
| 8 | 8 | 0.4% | |
| 5 | 7 | 0.4% | |
| 4 | 6 | 0.3% | |
| 21 | 6 | 0.3% | |
| 7 | 5 | 0.2% | |
| Other values (60) | 67 | 3.4% | |
| (Missing) | 1830 | 91.5% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0 | 28 | 1.4% | |
| 0.6666666667 | 1 | 0.1% | |
| 1 | 14 | 0.7% | |
| 2 | 10 | 0.5% | |
| 3 | 9 | 0.4% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 2162.357143 | 1 | 0.1% | |
| 1254.75 | 1 | 0.1% | |
| 931 | 1 | 0.1% | |
| 546 | 1 | 0.1% | |
| 479.8333333 | 1 | 0.1% |
media_meses_servicos
Highly correlated
This variable is highly correlated with max_meses_servicos and should be ignored for analysis
| Correlation | 0.9868452786 |
|---|
media_meses_servicos_all
Highly correlated
This variable is highly correlated with grau_instrucao_macro_analfabeto and should be ignored for analysis
| Correlation | 0.9765604493 |
|---|
media_vl_folha_coligados
Numeric
| Distinct count | 96 |
|---|---|
| Unique (%) | 4.8% |
| Missing (%) | 92.9% |
| Missing (n) | 1858 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 4118832.88 |
|---|---|
| Minimum | 20606.40039 |
| Maximum | 134910096 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 20606.40039 |
|---|---|
| 5-th percentile | 61819.19922 |
| Q1 | 185457.5938 |
| Median | 443037.6094 |
| Q3 | 1133352 |
| 95-th percentile | 11696119.15 |
| Maximum | 134910096 |
| Range | 134889489.6 |
| Interquartile range | 947894.4062 |
Descriptive statistics
| Standard deviation | 16323453.39 |
|---|---|
| Coef of variation | 3.963125931 |
| Kurtosis | 45.65863081 |
| Mean | 4118832.88 |
| MAD | 6159844.549 |
| Skewness | 6.523508433 |
| Sum | 584874269 |
| Variance | 2.664551306e+14 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 61819.19922 | 15 | 0.8% | |
| 185457.5938 | 6 | 0.3% | |
| 123638.3984 | 5 | 0.2% | |
| 247276.7969 | 4 | 0.2% | |
| 329702.4062 | 3 | 0.1% | |
| 309096 | 3 | 0.1% | |
| 206064 | 3 | 0.1% | |
| 494553.5938 | 3 | 0.1% | |
| 370915.1875 | 3 | 0.1% | |
| 1050926.375 | 2 | 0.1% | |
| Other values (85) | 95 | 4.8% | |
| (Missing) | 1858 | 92.9% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 20606.40039 | 2 | 0.1% | |
| 41212.80078 | 1 | 0.1% | |
| 61819.19922 | 15 | 0.8% | |
| 61819.20117 | 1 | 0.1% | |
| 82425.60156 | 1 | 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 134910096 | 1 | 0.1% | |
| 116297026.5 | 1 | 0.1% | |
| 65404711.85 | 1 | 0.1% | |
| 41406182.86 | 1 | 0.1% | |
| 28189555.55 | 1 | 0.1% |
media_vl_folha_coligados_gp
Highly correlated
This variable is highly correlated with media_funcionarios_coligados_gp and should be ignored for analysis
| Correlation | 0.9041231022 |
|---|
meses_ultima_contratacaco
Numeric
| Distinct count | 254 |
|---|---|
| Unique (%) | 12.7% |
| Missing (%) | 76.6% |
| Missing (n) | 1533 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 40.5289793 |
|---|---|
| Minimum | 2.066666667 |
| Maximum | 325.5333333 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 2.066666667 |
|---|---|
| 5-th percentile | 3.253333333 |
| Q1 | 10 |
| Median | 32.73333333 |
| Q3 | 58.16666667 |
| 95-th percentile | 106.22 |
| Maximum | 325.5333333 |
| Range | 323.4666667 |
| Interquartile range | 48.16666667 |
Descriptive statistics
| Standard deviation | 37.4965795 |
|---|---|
| Coef of variation | 0.9251794678 |
| Kurtosis | 8.371662919 |
| Mean | 40.5289793 |
| MAD | 27.83812052 |
| Skewness | 2.033555459 |
| Sum | 18927.03333 |
| Variance | 1405.993474 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 5.966666667 | 13 | 0.7% | |
| 2.933333333 | 10 | 0.5% | |
| 40.5 | 9 | 0.4% | |
| 8.033333333 | 8 | 0.4% | |
| 26.23333333 | 7 | 0.4% | |
| 6.966666667 | 7 | 0.4% | |
| 39.46666667 | 7 | 0.4% | |
| 7 | 6 | 0.3% | |
| 23.2 | 6 | 0.3% | |
| 24.2 | 6 | 0.3% | |
| Other values (243) | 388 | 19.4% | |
| (Missing) | 1533 | 76.6% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 2.066666667 | 1 | 0.1% | |
| 2.133333333 | 1 | 0.1% | |
| 2.166666667 | 1 | 0.1% | |
| 2.3 | 1 | 0.1% | |
| 2.4 | 1 | 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 325.5333333 | 1 | 0.1% | |
| 222.0666667 | 1 | 0.1% | |
| 178.4 | 1 | 0.1% | |
| 168.2 | 1 | 0.1% | |
| 167.2333333 | 1 | 0.1% |
min_faturamento_est_coligados
Numeric
| Distinct count | 42 |
|---|---|
| Unique (%) | 2.1% |
| Missing (%) | 86.9% |
| Missing (n) | 1738 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 446924.3031 |
|---|---|
| Minimum | 13946 |
| Maximum | 14465693 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 13946 |
|---|---|
| 5-th percentile | 83804.32 |
| Q1 | 210000 |
| Median | 210000 |
| Q3 | 210000 |
| 95-th percentile | 1112745.6 |
| Maximum | 14465693 |
| Range | 14451747 |
| Interquartile range | 0 |
Descriptive statistics
| Standard deviation | 1052321.548 |
|---|---|
| Coef of variation | 2.354585645 |
| Kurtosis | 122.7846314 |
| Mean | 446924.3031 |
| MAD | 410365.8844 |
| Skewness | 9.858228338 |
| Sum | 117094167.4 |
| Variance | 1.107380641e+12 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 210000 | 159 | 8.0% | |
| 930000 | 15 | 0.8% | |
| 185457.6 | 13 | 0.7% | |
| 123638.4 | 13 | 0.7% | |
| 370915.2 | 6 | 0.3% | |
| 50000 | 5 | 0.2% | |
| 989107.2 | 4 | 0.2% | |
| 247276.8 | 4 | 0.2% | |
| 206064 | 3 | 0.1% | |
| 82425.6 | 2 | 0.1% | |
| Other values (31) | 38 | 1.9% | |
| (Missing) | 1738 | 86.9% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 13946 | 1 | 0.1% | |
| 19079 | 1 | 0.1% | |
| 41212.8 | 2 | 0.1% | |
| 50000 | 5 | 0.2% | |
| 51516 | 1 | 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 14465693 | 1 | 0.1% | |
| 4657046.5 | 1 | 0.1% | |
| 3791577.5 | 1 | 0.1% | |
| 3585513.5 | 1 | 0.1% | |
| 3193992 | 1 | 0.1% |
min_faturamento_est_coligados_gp
Numeric
| Distinct count | 56 |
|---|---|
| Unique (%) | 2.8% |
| Missing (%) | 86.9% |
| Missing (n) | 1738 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 841766.8011 |
|---|---|
| Minimum | 19079 |
| Maximum | 81345780 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 19079 |
|---|---|
| 5-th percentile | 123638.4 |
| Q1 | 210000 |
| Median | 210000 |
| Q3 | 420000 |
| 95-th percentile | 1854576 |
| Maximum | 81345780 |
| Range | 81326701 |
| Interquartile range | 210000 |
Descriptive statistics
| Standard deviation | 5121975.978 |
|---|---|
| Coef of variation | 6.084792096 |
| Kurtosis | 236.4369519 |
| Mean | 841766.8011 |
| MAD | 999267.4416 |
| Skewness | 15.07431753 |
| Sum | 220542901.9 |
| Variance | 2.623463792e+13 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 210000 | 145 | 7.2% | |
| 930000 | 14 | 0.7% | |
| 123638.4 | 13 | 0.7% | |
| 185457.6 | 9 | 0.4% | |
| 420000 | 8 | 0.4% | |
| 370915.2 | 5 | 0.2% | |
| 50000 | 5 | 0.2% | |
| 630000 | 4 | 0.2% | |
| 247276.8 | 3 | 0.1% | |
| 1112745.6 | 3 | 0.1% | |
| Other values (45) | 53 | 2.6% | |
| (Missing) | 1738 | 86.9% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 19079 | 1 | 0.1% | |
| 41212.8 | 2 | 0.1% | |
| 50000 | 5 | 0.2% | |
| 51516 | 1 | 0.1% | |
| 61819.2 | 1 | 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 81345780 | 1 | 0.1% | |
| 14675693 | 1 | 0.1% | |
| 5608877 | 1 | 0.1% | |
| 4657046.5 | 1 | 0.1% | |
| 4450982.5 | 1 | 0.1% |
min_filiais_coligados
Highly correlated
This variable is highly correlated with min_faturamento_est_coligados_gp and should be ignored for analysis
| Correlation | 0.942216953 |
|---|
min_funcionarios_coligados_gp
Highly correlated
This variable is highly correlated with min_filiais_coligados and should be ignored for analysis
| Correlation | 0.9051143156 |
|---|
min_meses_servicos
Highly correlated
This variable is highly correlated with meses_ultima_contratacaco and should be ignored for analysis
| Correlation | 0.9357049853 |
|---|
min_meses_servicos_all
Highly correlated
This variable is highly correlated with grau_instrucao_macro_analfabeto and should be ignored for analysis
| Correlation | 0.9830648574 |
|---|
min_vl_folha_coligados
Numeric
| Distinct count | 47 |
|---|---|
| Unique (%) | 2.4% |
| Missing (%) | 92.9% |
| Missing (n) | 1858 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 1426195.058 |
|---|---|
| Minimum | 0 |
| Maximum | 134910100 |
| Zeros (%) | 0.1% |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 41212.8 |
| Q1 | 61819.2 |
| Median | 206064 |
| Q3 | 494553.6 |
| 95-th percentile | 1421841.6 |
| Maximum | 134910100 |
| Range | 134910100 |
| Interquartile range | 432734.4 |
Descriptive statistics
| Standard deviation | 11334339.33 |
|---|---|
| Coef of variation | 7.947257469 |
| Kurtosis | 139.3003108 |
| Mean | 1426195.058 |
| MAD | 2115644.891 |
| Skewness | 11.7534269 |
| Sum | 202519698.3 |
| Variance | 1.284672481e+14 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 61819.2 | 28 | 1.4% | |
| 123638.4 | 12 | 0.6% | |
| 185457.6 | 8 | 0.4% | |
| 329702.4 | 6 | 0.3% | |
| 103032 | 5 | 0.2% | |
| 20606.4 | 5 | 0.2% | |
| 206064 | 5 | 0.2% | |
| 494553.6 | 4 | 0.2% | |
| 618192 | 4 | 0.2% | |
| 247276.8 | 4 | 0.2% | |
| Other values (36) | 61 | 3.0% | |
| (Missing) | 1858 | 92.9% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0 | 2 | 0.1% | |
| 20606.4 | 5 | 0.2% | |
| 41212.8 | 3 | 0.1% | |
| 61819.2 | 28 | 1.4% | |
| 82425.6 | 2 | 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 134910100 | 1 | 0.1% | |
| 10365019 | 1 | 0.1% | |
| 7232846.5 | 1 | 0.1% | |
| 2328523.2 | 1 | 0.1% | |
| 1916395.2 | 1 | 0.1% |
min_vl_folha_coligados_gp
Highly correlated
This variable is highly correlated with min_vl_folha_coligados and should be ignored for analysis
| Correlation | 0.9817628137 |
|---|
natureza_juridica_macro
Categorical
| Distinct count | 6 |
|---|---|
| Unique (%) | 0.3% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| OUTROS | |
|---|---|
| ENTIDADES EMPRESARIAIS | |
| ENTIDADES SEM FINS LUCRATIVOS | 137 |
| Other values (3) | 31 |
| Value | Count | Frequency (%) | |
| OUTROS | 1406 | 70.3% | |
| ENTIDADES EMPRESARIAIS | 426 | 21.3% | |
| ENTIDADES SEM FINS LUCRATIVOS | 137 | 6.9% | |
| ADMINISTRACAO PUBLICA | 15 | 0.8% | |
| CARGO POLITICO | 12 | 0.6% | |
| PESSOAS FISICAS | 4 | 0.2% |
| Max length | 29 |
|---|---|
| Mean length | 11.162 |
| Min length | 6 |
| Contains chars | True |
| Contains digits | False |
| Contains spaces | True |
| Contains non-words | True |
nm_divisao
Categorical
| Distinct count | 72 |
|---|---|
| Unique (%) | 3.6% |
| Missing (%) | 0.5% |
| Missing (n) | 11 |
| COMERCIO VAREJISTA | |
|---|---|
| ATIVIDADES DE ORGANIZACOES ASSOCIATIVAS | 140 |
| ALIMENTACAO | 121 |
| Other values (68) |
| Value | Count | Frequency (%) | |
| COMERCIO VAREJISTA | 749 | 37.5% | |
| ATIVIDADES DE ORGANIZACOES ASSOCIATIVAS | 140 | 7.0% | |
| ALIMENTACAO | 121 | 6.0% | |
| COMERCIO E REPARACAO DE VEICULOS AUTOMOTORES E MOTOCICLETAS | 115 | 5.8% | |
| SERVICOS ESPECIALIZADOS PARA CONSTRUCAO | 73 | 3.6% | |
| COMERCIO POR ATACADO EXCETO VEICULOS AUTOMOTORES E MOTOCICLETAS | 63 | 3.1% | |
| OUTRAS ATIVIDADES DE SERVICOS PESSOAIS | 54 | 2.7% | |
| SERVICOS DE ESCRITORIO DE APOIO ADMINISTRATIVO E OUTROS SERVICOS PRESTADOS PRINCIPALMENTE AS EMPRESAS | 52 | 2.6% | |
| EDUCACAO | 48 | 2.4% | |
| CONSTRUCAO DE EDIFICIOS | 42 | 2.1% | |
| Other values (61) | 532 | 26.6% |
| Max length | 120 |
|---|---|
| Mean length | 33.6125 |
| Min length | 3 |
| Contains chars | True |
| Contains digits | False |
| Contains spaces | True |
| Contains non-words | True |
nm_meso_regiao
Categorical
| Distinct count | 20 |
|---|---|
| Unique (%) | 1.0% |
| Missing (%) | 13.9% |
| Missing (n) | 278 |
| CENTRO AMAZONENSE | |
|---|---|
| LESTE POTIGUAR | |
| NORTE MARANHENSE | |
| Other values (16) | |
| (Missing) |
| Value | Count | Frequency (%) | |
| CENTRO AMAZONENSE | 319 | 16.0% | |
| LESTE POTIGUAR | 264 | 13.2% | |
| NORTE MARANHENSE | 263 | 13.2% | |
| CENTRO NORTE PIAUIENSE | 165 | 8.2% | |
| OESTE MARANHENSE | 101 | 5.1% | |
| VALE DO ACRE | 83 | 4.2% | |
| OESTE POTIGUAR | 78 | 3.9% | |
| LESTE MARANHENSE | 77 | 3.9% | |
| SUDOESTE PIAUIENSE | 59 | 2.9% | |
| CENTRO MARANHENSE | 59 | 2.9% | |
| Other values (9) | 254 | 12.7% | |
| (Missing) | 278 | 13.9% |
| Max length | 22 |
|---|---|
| Mean length | 14.3615 |
| Min length | 3 |
| Contains chars | True |
| Contains digits | False |
| Contains spaces | True |
| Contains non-words | True |
nm_micro_regiao
Categorical
| Distinct count | 73 |
|---|---|
| Unique (%) | 3.6% |
| Missing (%) | 13.9% |
| Missing (n) | 278 |
| MANAUS | |
|---|---|
| NATAL | |
| AGLOMERACAO URBANA DE SAO LUIS | 206 |
| Other values (69) | |
| (Missing) |
| Value | Count | Frequency (%) | |
| MANAUS | 256 | 12.8% | |
| NATAL | 216 | 10.8% | |
| AGLOMERACAO URBANA DE SAO LUIS | 206 | 10.3% | |
| TERESINA | 142 | 7.1% | |
| RIO BRANCO | 70 | 3.5% | |
| IMPERATRIZ | 56 | 2.8% | |
| MOSSORO | 39 | 1.9% | |
| MEDIO MEARIM | 35 | 1.8% | |
| CAXIAS | 32 | 1.6% | |
| PINDARE | 31 | 1.6% | |
| Other values (62) | 639 | 31.9% | |
| (Missing) | 278 | 13.9% |
| Max length | 33 |
|---|---|
| Mean length | 11.012 |
| Min length | 3 |
| Contains chars | True |
| Contains digits | False |
| Contains spaces | True |
| Contains non-words | True |
nm_segmento
Categorical
| Distinct count | 21 |
|---|---|
| Unique (%) | 1.1% |
| Missing (%) | 0.5% |
| Missing (n) | 11 |
| COMERCIO; REPARACAO DE VEICULOS AUTOMOTORES E MOTOCICLETAS | |
|---|---|
| OUTRAS ATIVIDADES DE SERVICOS | |
| INDUSTRIAS DE TRANSFORMACAO | 140 |
| Other values (17) |
| Value | Count | Frequency (%) | |
| COMERCIO; REPARACAO DE VEICULOS AUTOMOTORES E MOTOCICLETAS | 927 | 46.4% | |
| OUTRAS ATIVIDADES DE SERVICOS | 223 | 11.2% | |
| INDUSTRIAS DE TRANSFORMACAO | 140 | 7.0% | |
| ALOJAMENTO E ALIMENTACAO | 133 | 6.7% | |
| CONSTRUCAO | 122 | 6.1% | |
| ATIVIDADES ADMINISTRATIVAS E SERVICOS COMPLEMENTARES | 102 | 5.1% | |
| ATIVIDADES PROFISSIONAIS CIENTIFICAS E TECNICAS | 78 | 3.9% | |
| TRANSPORTE ARMAZENAGEM E CORREIO | 59 | 2.9% | |
| SAUDE HUMANA E SERVICOS SOCIAIS | 50 | 2.5% | |
| EDUCACAO | 48 | 2.4% | |
| Other values (10) | 107 | 5.3% |
| Max length | 62 |
|---|---|
| Mean length | 42.663 |
| Min length | 3 |
| Contains chars | True |
| Contains digits | False |
| Contains spaces | True |
| Contains non-words | True |
nu_meses_rescencia
Numeric
| Distinct count | 26 |
|---|---|
| Unique (%) | 1.3% |
| Missing (%) | 10.0% |
| Missing (n) | 199 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 25.31260411 |
|---|---|
| Minimum | 7 |
| Maximum | 54 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 7 |
|---|---|
| 5-th percentile | 9 |
| Q1 | 22 |
| Median | 23 |
| Q3 | 25 |
| 95-th percentile | 48 |
| Maximum | 54 |
| Range | 47 |
| Interquartile range | 3 |
Descriptive statistics
| Standard deviation | 9.684724624 |
|---|---|
| Coef of variation | 0.3826048313 |
| Kurtosis | 1.859388261 |
| Mean | 25.31260411 |
| MAD | 5.9891232 |
| Skewness | 1.277543731 |
| Sum | 45588 |
| Variance | 93.79389105 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 23 | 479 | 23.9% | |
| 22 | 357 | 17.8% | |
| 24 | 197 | 9.8% | |
| 48 | 125 | 6.2% | |
| 25 | 115 | 5.8% | |
| 26 | 113 | 5.7% | |
| 27 | 87 | 4.3% | |
| 21 | 51 | 2.5% | |
| 9 | 48 | 2.4% | |
| 47 | 39 | 1.9% | |
| Other values (15) | 190 | 9.5% | |
| (Missing) | 199 | 10.0% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 7 | 39 | 1.9% | |
| 8 | 17 | 0.9% | |
| 9 | 48 | 2.4% | |
| 10 | 16 | 0.8% | |
| 11 | 17 | 0.9% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 54 | 10 | 0.5% | |
| 52 | 3 | 0.1% | |
| 50 | 24 | 1.2% | |
| 49 | 20 | 1.0% | |
| 48 | 125 | 6.2% |
percent_func_genero_fem
Highly correlated
This variable is highly correlated with grau_instrucao_macro_analfabeto and should be ignored for analysis
| Correlation | 0.9457217032 |
|---|
percent_func_genero_masc
Numeric
| Distinct count | 65 |
|---|---|
| Unique (%) | 3.2% |
| Missing (%) | 83.0% |
| Missing (n) | 1659 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 53.98489736 |
|---|---|
| Minimum | 0 |
| Maximum | 100 |
| Zeros (%) | 4.2% |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 12.5 |
| Median | 57.14 |
| Q3 | 100 |
| 95-th percentile | 100 |
| Maximum | 100 |
| Range | 100 |
| Interquartile range | 87.5 |
Descriptive statistics
| Standard deviation | 39.18282514 |
|---|---|
| Coef of variation | 0.7258108667 |
| Kurtosis | -1.493931653 |
| Mean | 53.98489736 |
| MAD | 34.86371118 |
| Skewness | -0.181502856 |
| Sum | 18408.85 |
| Variance | 1535.293786 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 100 | 94 | 4.7% | |
| 0 | 83 | 4.2% | |
| 50 | 28 | 1.4% | |
| 66.67 | 15 | 0.8% | |
| 33.33 | 13 | 0.7% | |
| 75 | 8 | 0.4% | |
| 83.33 | 7 | 0.4% | |
| 60 | 7 | 0.4% | |
| 25 | 7 | 0.4% | |
| 80 | 4 | 0.2% | |
| Other values (54) | 75 | 3.8% | |
| (Missing) | 1659 | 83.0% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0 | 83 | 4.2% | |
| 11.11 | 2 | 0.1% | |
| 12.5 | 1 | 0.1% | |
| 14.29 | 1 | 0.1% | |
| 16 | 1 | 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 100 | 94 | 4.7% | |
| 98.04 | 1 | 0.1% | |
| 93.33 | 2 | 0.1% | |
| 93.24 | 1 | 0.1% | |
| 92.31 | 1 | 0.1% |
qt_admitidos
Numeric
| Distinct count | 85 |
|---|---|
| Unique (%) | 4.2% |
| Missing (%) | 76.6% |
| Missing (n) | 1533 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 35.05567452 |
|---|---|
| Minimum | 1 |
| Maximum | 2000 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| Median | 6 |
| Q3 | 18 |
| 95-th percentile | 147.8 |
| Maximum | 2000 |
| Range | 1999 |
| Interquartile range | 16 |
Descriptive statistics
| Standard deviation | 144.849282 |
|---|---|
| Coef of variation | 4.131978174 |
| Kurtosis | 124.459545 |
| Mean | 35.05567452 |
| MAD | 46.63998643 |
| Skewness | 10.32530733 |
| Sum | 16371 |
| Variance | 20981.31449 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 1 | 75 | 3.8% | |
| 2 | 55 | 2.8% | |
| 3 | 33 | 1.7% | |
| 4 | 30 | 1.5% | |
| 6 | 22 | 1.1% | |
| 5 | 21 | 1.1% | |
| 8 | 17 | 0.9% | |
| 7 | 17 | 0.9% | |
| 9 | 15 | 0.8% | |
| 17 | 11 | 0.5% | |
| Other values (74) | 171 | 8.6% | |
| (Missing) | 1533 | 76.6% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 1 | 75 | 3.8% | |
| 2 | 55 | 2.8% | |
| 3 | 33 | 1.7% | |
| 4 | 30 | 1.5% | |
| 5 | 21 | 1.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 2000 | 1 | 0.1% | |
| 1812 | 1 | 0.1% | |
| 890 | 1 | 0.1% | |
| 687 | 1 | 0.1% | |
| 647 | 1 | 0.1% |
qt_admitidos_12meses
Numeric
| Distinct count | 22 |
|---|---|
| Unique (%) | 1.1% |
| Missing (%) | 76.6% |
| Missing (n) | 1533 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 1.209850107 |
|---|---|
| Minimum | 0 |
| Maximum | 65 |
| Zeros (%) | 17.1% |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| Median | 0 |
| Q3 | 1 |
| 95-th percentile | 6 |
| Maximum | 65 |
| Range | 65 |
| Interquartile range | 1 |
Descriptive statistics
| Standard deviation | 4.841131596 |
|---|---|
| Coef of variation | 4.001430894 |
| Kurtosis | 87.57457722 |
| Mean | 1.209850107 |
| MAD | 1.830445369 |
| Skewness | 8.343214083 |
| Sum | 565 |
| Variance | 23.43655513 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 0 | 342 | 17.1% | |
| 1 | 65 | 3.2% | |
| 2 | 16 | 0.8% | |
| 3 | 10 | 0.5% | |
| 4 | 7 | 0.4% | |
| 6 | 6 | 0.3% | |
| 12 | 4 | 0.2% | |
| 5 | 2 | 0.1% | |
| 7 | 2 | 0.1% | |
| 14 | 2 | 0.1% | |
| Other values (11) | 11 | 0.5% | |
| (Missing) | 1533 | 76.6% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0 | 342 | 17.1% | |
| 1 | 65 | 3.2% | |
| 2 | 16 | 0.8% | |
| 3 | 10 | 0.5% | |
| 4 | 7 | 0.4% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 65 | 1 | 0.1% | |
| 46 | 1 | 0.1% | |
| 35 | 1 | 0.1% | |
| 28 | 1 | 0.1% | |
| 21 | 1 | 0.1% |
qt_alteracao_socio_180d
Constant
This variable is constant and should be ignored for analysis
| Constant value | nan |
|---|
qt_alteracao_socio_365d
Constant
This variable is constant and should be ignored for analysis
| Constant value | nan |
|---|
qt_alteracao_socio_90d
Constant
This variable is constant and should be ignored for analysis
| Constant value | nan |
|---|
qt_alteracao_socio_total
Constant
This variable is constant and should be ignored for analysis
| Constant value | nan |
|---|
qt_art
Numeric
| Distinct count | 8 |
|---|---|
| Unique (%) | 0.4% |
| Missing (%) | 98.7% |
| Missing (n) | 1974 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 2.846153846 |
|---|---|
| Minimum | 1 |
| Maximum | 14 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| Median | 2 |
| Q3 | 3 |
| 95-th percentile | 10.25 |
| Maximum | 14 |
| Range | 13 |
| Interquartile range | 2 |
Descriptive statistics
| Standard deviation | 3.270379889 |
|---|---|
| Coef of variation | 1.149052393 |
| Kurtosis | 5.838414503 |
| Mean | 2.846153846 |
| MAD | 2.094674556 |
| Skewness | 2.474656468 |
| Sum | 74 |
| Variance | 10.69538462 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 1 | 12 | 0.6% | |
| 2 | 6 | 0.3% | |
| 3 | 4 | 0.2% | |
| 14 | 1 | 0.1% | |
| 5 | 1 | 0.1% | |
| 11 | 1 | 0.1% | |
| 8 | 1 | 0.1% | |
| (Missing) | 1974 | 98.7% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 1 | 12 | 0.6% | |
| 2 | 6 | 0.3% | |
| 3 | 4 | 0.2% | |
| 5 | 1 | 0.1% | |
| 8 | 1 | 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 14 | 1 | 0.1% | |
| 11 | 1 | 0.1% | |
| 8 | 1 | 0.1% | |
| 5 | 1 | 0.1% | |
| 3 | 4 | 0.2% |
qt_coligadas
Numeric
| Distinct count | 14 |
|---|---|
| Unique (%) | 0.7% |
| Missing (%) | 89.5% |
| Missing (n) | 1791 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 2.291866029 |
|---|---|
| Minimum | 1 |
| Maximum | 26 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| Median | 1 |
| Q3 | 2 |
| 95-th percentile | 7 |
| Maximum | 26 |
| Range | 25 |
| Interquartile range | 1 |
Descriptive statistics
| Standard deviation | 2.688290288 |
|---|---|
| Coef of variation | 1.172970084 |
| Kurtosis | 31.34631573 |
| Mean | 2.291866029 |
| MAD | 1.599185 |
| Skewness | 4.591695706 |
| Sum | 479 |
| Variance | 7.226904674 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 1 | 121 | 6.0% | |
| 2 | 37 | 1.8% | |
| 3 | 18 | 0.9% | |
| 5 | 10 | 0.5% | |
| 4 | 6 | 0.3% | |
| 7 | 5 | 0.2% | |
| 6 | 4 | 0.2% | |
| 8 | 3 | 0.1% | |
| 26 | 1 | 0.1% | |
| 14 | 1 | 0.1% | |
| Other values (3) | 3 | 0.1% | |
| (Missing) | 1791 | 89.5% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 1 | 121 | 6.0% | |
| 2 | 37 | 1.8% | |
| 3 | 18 | 0.9% | |
| 4 | 6 | 0.3% | |
| 5 | 10 | 0.5% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 26 | 1 | 0.1% | |
| 14 | 1 | 0.1% | |
| 13 | 1 | 0.1% | |
| 11 | 1 | 0.1% | |
| 9 | 1 | 0.1% |
qt_coligados
Highly correlated
This variable is highly correlated with qt_coligadas and should be ignored for analysis
| Correlation | 0.9950754524 |
|---|
qt_coligados_agropecuaria
Categorical
| Distinct count | 5 |
|---|---|
| Unique (%) | 0.2% |
| Missing (%) | 86.9% |
| Missing (n) | 1737 |
| 0 | 248 |
|---|---|
| 1 | 11 |
| 2 | 3 |
| (Missing) |
| Value | Count | Frequency (%) | |
| 0 | 248 | 12.4% | |
| 1 | 11 | 0.5% | |
| 2 | 3 | 0.1% | |
| 4 | 1 | 0.1% | |
| (Missing) | 1737 | 86.9% |
| Max length | 3 |
|---|---|
| Mean length | 3 |
| Min length | 3 |
| Contains chars | True |
| Contains digits | True |
| Contains spaces | False |
| Contains non-words | True |
qt_coligados_atividade_alto
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 86.9% |
| Missing (n) | 1737 |
| 0 | 263 |
|---|---|
| (Missing) |
| Value | Count | Frequency (%) | |
| 0 | 263 | 13.2% | |
| (Missing) | 1737 | 86.9% |
qt_coligados_atividade_baixo
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 86.9% |
| Missing (n) | 1737 |
| 0 | 263 |
|---|---|
| (Missing) |
| Value | Count | Frequency (%) | |
| 0 | 263 | 13.2% | |
| (Missing) | 1737 | 86.9% |
qt_coligados_atividade_inativo
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 86.9% |
| Missing (n) | 1737 |
| 0 | 263 |
|---|---|
| (Missing) |
| Value | Count | Frequency (%) | |
| 0 | 263 | 13.2% | |
| (Missing) | 1737 | 86.9% |
qt_coligados_atividade_medio
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 86.9% |
| Missing (n) | 1737 |
| 0 | 263 |
|---|---|
| (Missing) |
| Value | Count | Frequency (%) | |
| 0 | 263 | 13.2% | |
| (Missing) | 1737 | 86.9% |
qt_coligados_atividade_mt_baixo
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 86.9% |
| Missing (n) | 1737 |
| 0 | 263 |
|---|---|
| (Missing) |
| Value | Count | Frequency (%) | |
| 0 | 263 | 13.2% | |
| (Missing) | 1737 | 86.9% |
qt_coligados_ativo
Highly correlated
This variable is highly correlated with qt_coligados and should be ignored for analysis
| Correlation | 0.9922537325 |
|---|
qt_coligados_baixada
Highly correlated
This variable is highly correlated with min_vl_folha_coligados_gp and should be ignored for analysis
| Correlation | 0.9776137125 |
|---|
qt_coligados_ccivil
Numeric
| Distinct count | 7 |
|---|---|
| Unique (%) | 0.4% |
| Missing (%) | 86.9% |
| Missing (n) | 1737 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 0.3536121673 |
|---|---|
| Minimum | 0 |
| Maximum | 6 |
| Zeros (%) | 10.6% |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| Median | 0 |
| Q3 | 0 |
| 95-th percentile | 2 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range | 0 |
Descriptive statistics
| Standard deviation | 0.9330939842 |
|---|---|
| Coef of variation | 2.638749654 |
| Kurtosis | 16.15012997 |
| Mean | 0.3536121673 |
| MAD | 0.5700819731 |
| Skewness | 3.727535875 |
| Sum | 93 |
| Variance | 0.8706643834 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 0 | 212 | 10.6% | |
| 1 | 30 | 1.5% | |
| 2 | 12 | 0.6% | |
| 3 | 4 | 0.2% | |
| 5 | 3 | 0.1% | |
| 6 | 2 | 0.1% | |
| (Missing) | 1737 | 86.9% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0 | 212 | 10.6% | |
| 1 | 30 | 1.5% | |
| 2 | 12 | 0.6% | |
| 3 | 4 | 0.2% | |
| 5 | 3 | 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 6 | 2 | 0.1% | |
| 5 | 3 | 0.1% | |
| 3 | 4 | 0.2% | |
| 2 | 12 | 0.6% | |
| 1 | 30 | 1.5% |
qt_coligados_centro
Numeric
| Distinct count | 6 |
|---|---|
| Unique (%) | 0.3% |
| Missing (%) | 86.9% |
| Missing (n) | 1737 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 0.08745247148 |
|---|---|
| Minimum | 0 |
| Maximum | 6 |
| Zeros (%) | 12.8% |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| Median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range | 0 |
Descriptive statistics
| Standard deviation | 0.6021228133 |
|---|---|
| Coef of variation | 6.885143474 |
| Kurtosis | 74.31706993 |
| Mean | 0.08745247148 |
| MAD | 0.1695846405 |
| Skewness | 8.314029607 |
| Sum | 23 |
| Variance | 0.3625518823 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 0 | 255 | 12.8% | |
| 1 | 3 | 0.1% | |
| 3 | 2 | 0.1% | |
| 6 | 2 | 0.1% | |
| 2 | 1 | 0.1% | |
| (Missing) | 1737 | 86.9% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0 | 255 | 12.8% | |
| 1 | 3 | 0.1% | |
| 2 | 1 | 0.1% | |
| 3 | 2 | 0.1% | |
| 6 | 2 | 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 6 | 2 | 0.1% | |
| 3 | 2 | 0.1% | |
| 2 | 1 | 0.1% | |
| 1 | 3 | 0.1% | |
| 0 | 255 | 12.8% |
qt_coligados_comercio
Numeric
| Distinct count | 14 |
|---|---|
| Unique (%) | 0.7% |
| Missing (%) | 86.9% |
| Missing (n) | 1737 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 0.9961977186 |
|---|---|
| Minimum | 0 |
| Maximum | 20 |
| Zeros (%) | 7.2% |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| Median | 0 |
| Q3 | 1 |
| 95-th percentile | 4 |
| Maximum | 20 |
| Range | 20 |
| Interquartile range | 1 |
Descriptive statistics
| Standard deviation | 2.061086386 |
|---|---|
| Coef of variation | 2.068953128 |
| Kurtosis | 32.12491127 |
| Mean | 0.9961977186 |
| MAD | 1.090893319 |
| Skewness | 4.767635124 |
| Sum | 262 |
| Variance | 4.248077091 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 0 | 144 | 7.2% | |
| 1 | 70 | 3.5% | |
| 2 | 27 | 1.4% | |
| 3 | 7 | 0.4% | |
| 8 | 3 | 0.1% | |
| 6 | 3 | 0.1% | |
| 4 | 2 | 0.1% | |
| 5 | 2 | 0.1% | |
| 10 | 1 | 0.1% | |
| 7 | 1 | 0.1% | |
| Other values (3) | 3 | 0.1% | |
| (Missing) | 1737 | 86.9% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0 | 144 | 7.2% | |
| 1 | 70 | 3.5% | |
| 2 | 27 | 1.4% | |
| 3 | 7 | 0.4% | |
| 4 | 2 | 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 20 | 1 | 0.1% | |
| 11 | 1 | 0.1% | |
| 10 | 1 | 0.1% | |
| 9 | 1 | 0.1% | |
| 8 | 3 | 0.1% |
qt_coligados_epp
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 86.9% |
| Missing (n) | 1737 |
| 0 | 263 |
|---|---|
| (Missing) |
| Value | Count | Frequency (%) | |
| 0 | 263 | 13.2% | |
| (Missing) | 1737 | 86.9% |
qt_coligados_exterior
Categorical
| Distinct count | 5 |
|---|---|
| Unique (%) | 0.2% |
| Missing (%) | 86.9% |
| Missing (n) | 1737 |
| 0 | 257 |
|---|---|
| 2 | 3 |
| 1 | 2 |
| (Missing) |
| Value | Count | Frequency (%) | |
| 0 | 257 | 12.8% | |
| 2 | 3 | 0.1% | |
| 1 | 2 | 0.1% | |
| 3 | 1 | 0.1% | |
| (Missing) | 1737 | 86.9% |
| Max length | 3 |
|---|---|
| Mean length | 3 |
| Min length | 3 |
| Contains chars | True |
| Contains digits | True |
| Contains spaces | False |
| Contains non-words | True |
qt_coligados_inapta
Categorical
| Distinct count | 4 |
|---|---|
| Unique (%) | 0.2% |
| Missing (%) | 86.9% |
| Missing (n) | 1737 |
| 0 | 258 |
|---|---|
| 1 | 3 |
| 2 | 2 |
| (Missing) |
| Value | Count | Frequency (%) | |
| 0 | 258 | 12.9% | |
| 1 | 3 | 0.1% | |
| 2 | 2 | 0.1% | |
| (Missing) | 1737 | 86.9% |
| Max length | 3 |
|---|---|
| Mean length | 3 |
| Min length | 3 |
| Contains chars | True |
| Contains digits | True |
| Contains spaces | False |
| Contains non-words | True |
qt_coligados_industria
Numeric
| Distinct count | 7 |
|---|---|
| Unique (%) | 0.4% |
| Missing (%) | 86.9% |
| Missing (n) | 1737 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 0.6425855513 |
|---|---|
| Minimum | 0 |
| Maximum | 111 |
| Zeros (%) | 11.5% |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| Median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 111 |
| Range | 111 |
| Interquartile range | 0 |
Descriptive statistics
| Standard deviation | 6.902926429 |
|---|---|
| Coef of variation | 10.74242397 |
| Kurtosis | 252.1246012 |
| Mean | 0.6425855513 |
| MAD | 1.11902731 |
| Skewness | 15.74305704 |
| Sum | 169 |
| Variance | 47.65039329 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 0 | 229 | 11.5% | |
| 1 | 24 | 1.2% | |
| 3 | 4 | 0.2% | |
| 2 | 4 | 0.2% | |
| 111 | 1 | 0.1% | |
| 14 | 1 | 0.1% | |
| (Missing) | 1737 | 86.9% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0 | 229 | 11.5% | |
| 1 | 24 | 1.2% | |
| 2 | 4 | 0.2% | |
| 3 | 4 | 0.2% | |
| 14 | 1 | 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 111 | 1 | 0.1% | |
| 14 | 1 | 0.1% | |
| 3 | 4 | 0.2% | |
| 2 | 4 | 0.2% | |
| 1 | 24 | 1.2% |
qt_coligados_ltda
Categorical
| Distinct count | 4 |
|---|---|
| Unique (%) | 0.2% |
| Missing (%) | 86.9% |
| Missing (n) | 1737 |
| 0 | 251 |
|---|---|
| 1 | 9 |
| 2 | 3 |
| (Missing) |
| Value | Count | Frequency (%) | |
| 0 | 251 | 12.6% | |
| 1 | 9 | 0.4% | |
| 2 | 3 | 0.1% | |
| (Missing) | 1737 | 86.9% |
| Max length | 3 |
|---|---|
| Mean length | 3 |
| Min length | 3 |
| Contains chars | True |
| Contains digits | True |
| Contains spaces | False |
| Contains non-words | True |
qt_coligados_matriz
Highly correlated
This variable is highly correlated with qt_coligados_ativo and should be ignored for analysis
| Correlation | 0.9940913825 |
|---|
qt_coligados_me
Boolean
| Distinct count | 3 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 86.9% |
| Missing (n) | 1737 |
| 0 | 262 |
|---|---|
| 1 | 1 |
| (Missing) |
| Value | Count | Frequency (%) | |
| 0 | 262 | 13.1% | |
| 1 | 1 | 0.1% | |
| (Missing) | 1737 | 86.9% |
qt_coligados_mei
Categorical
| Distinct count | 4 |
|---|---|
| Unique (%) | 0.2% |
| Missing (%) | 86.9% |
| Missing (n) | 1737 |
| 0 | 255 |
|---|---|
| 1 | 7 |
| 2 | 1 |
| (Missing) |
| Value | Count | Frequency (%) | |
| 0 | 255 | 12.8% | |
| 1 | 7 | 0.4% | |
| 2 | 1 | 0.1% | |
| (Missing) | 1737 | 86.9% |
| Max length | 3 |
|---|---|
| Mean length | 3 |
| Min length | 3 |
| Contains chars | True |
| Contains digits | True |
| Contains spaces | False |
| Contains non-words | True |
qt_coligados_nordeste
Numeric
| Distinct count | 17 |
|---|---|
| Unique (%) | 0.9% |
| Missing (%) | 86.9% |
| Missing (n) | 1737 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 2 |
|---|---|
| Minimum | 0 |
| Maximum | 36 |
| Zeros (%) | 4.7% |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| Median | 1 |
| Q3 | 2 |
| 95-th percentile | 7 |
| Maximum | 36 |
| Range | 36 |
| Interquartile range | 2 |
Descriptive statistics
| Standard deviation | 4.009530631 |
|---|---|
| Coef of variation | 2.004765315 |
| Kurtosis | 34.70289867 |
| Mean | 2 |
| MAD | 2.038022814 |
| Skewness | 5.240064904 |
| Sum | 526 |
| Variance | 16.07633588 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 0 | 93 | 4.7% | |
| 1 | 82 | 4.1% | |
| 2 | 38 | 1.9% | |
| 3 | 13 | 0.7% | |
| 5 | 8 | 0.4% | |
| 6 | 6 | 0.3% | |
| 7 | 6 | 0.3% | |
| 4 | 5 | 0.2% | |
| 8 | 5 | 0.2% | |
| 16 | 1 | 0.1% | |
| Other values (6) | 6 | 0.3% | |
| (Missing) | 1737 | 86.9% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0 | 93 | 4.7% | |
| 1 | 82 | 4.1% | |
| 2 | 38 | 1.9% | |
| 3 | 13 | 0.7% | |
| 4 | 5 | 0.2% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 36 | 1 | 0.1% | |
| 31 | 1 | 0.1% | |
| 25 | 1 | 0.1% | |
| 20 | 1 | 0.1% | |
| 16 | 1 | 0.1% |
qt_coligados_norte
Highly correlated
This variable is highly correlated with idade_ate_18 and should be ignored for analysis
| Correlation | 0.9001915505 |
|---|
qt_coligados_nula
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 86.9% |
| Missing (n) | 1737 |
| 0 | 263 |
|---|---|
| (Missing) |
| Value | Count | Frequency (%) | |
| 0 | 263 | 13.2% | |
| (Missing) | 1737 | 86.9% |
qt_coligados_sa
Highly correlated
This variable is highly correlated with qt_coligados_matriz and should be ignored for analysis
| Correlation | 0.9176972372 |
|---|
qt_coligados_serviço
Numeric
| Distinct count | 17 |
|---|---|
| Unique (%) | 0.9% |
| Missing (%) | 86.9% |
| Missing (n) | 1737 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 1.768060837 |
|---|---|
| Minimum | 0 |
| Maximum | 20 |
| Zeros (%) | 4.8% |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| Median | 1 |
| Q3 | 2 |
| 95-th percentile | 7 |
| Maximum | 20 |
| Range | 20 |
| Interquartile range | 2 |
Descriptive statistics
| Standard deviation | 2.890397177 |
|---|---|
| Coef of variation | 1.63478378 |
| Kurtosis | 14.51365162 |
| Mean | 1.768060837 |
| MAD | 1.804739117 |
| Skewness | 3.352582746 |
| Sum | 465 |
| Variance | 8.354395844 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 0 | 96 | 4.8% | |
| 1 | 88 | 4.4% | |
| 2 | 27 | 1.4% | |
| 3 | 15 | 0.8% | |
| 7 | 9 | 0.4% | |
| 4 | 7 | 0.4% | |
| 5 | 6 | 0.3% | |
| 6 | 4 | 0.2% | |
| 8 | 4 | 0.2% | |
| 17 | 1 | 0.1% | |
| Other values (6) | 6 | 0.3% | |
| (Missing) | 1737 | 86.9% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0 | 96 | 4.8% | |
| 1 | 88 | 4.4% | |
| 2 | 27 | 1.4% | |
| 3 | 15 | 0.8% | |
| 4 | 7 | 0.4% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 20 | 1 | 0.1% | |
| 19 | 1 | 0.1% | |
| 17 | 1 | 0.1% | |
| 15 | 1 | 0.1% | |
| 11 | 1 | 0.1% |
qt_coligados_sudeste
Highly correlated
This variable is highly correlated with qt_coligados_sa and should be ignored for analysis
| Correlation | 0.9150368291 |
|---|
qt_coligados_sul
Numeric
| Distinct count | 6 |
|---|---|
| Unique (%) | 0.3% |
| Missing (%) | 86.9% |
| Missing (n) | 1737 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 0.04562737643 |
|---|---|
| Minimum | 0 |
| Maximum | 5 |
| Zeros (%) | 12.9% |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| Median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range | 0 |
Descriptive statistics
| Standard deviation | 0.3880489044 |
|---|---|
| Coef of variation | 8.504738489 |
| Kurtosis | 116.6502915 |
| Mean | 0.04562737643 |
| MAD | 0.08951987162 |
| Skewness | 10.30566678 |
| Sum | 12 |
| Variance | 0.1505819522 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 0 | 258 | 12.9% | |
| 1 | 2 | 0.1% | |
| 2 | 1 | 0.1% | |
| 3 | 1 | 0.1% | |
| 5 | 1 | 0.1% | |
| (Missing) | 1737 | 86.9% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0 | 258 | 12.9% | |
| 1 | 2 | 0.1% | |
| 2 | 1 | 0.1% | |
| 3 | 1 | 0.1% | |
| 5 | 1 | 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 5 | 1 | 0.1% | |
| 3 | 1 | 0.1% | |
| 2 | 1 | 0.1% | |
| 1 | 2 | 0.1% | |
| 0 | 258 | 12.9% |
qt_coligados_suspensa
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 86.9% |
| Missing (n) | 1737 |
| 0 | 263 |
|---|---|
| (Missing) |
| Value | Count | Frequency (%) | |
| 0 | 263 | 13.2% | |
| (Missing) | 1737 | 86.9% |
qt_desligados
Numeric
| Distinct count | 76 |
|---|---|
| Unique (%) | 3.8% |
| Missing (%) | 76.6% |
| Missing (n) | 1533 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 25.1006424 |
|---|---|
| Minimum | 0 |
| Maximum | 1985 |
| Zeros (%) | 2.7% |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| Median | 4 |
| Q3 | 12 |
| 95-th percentile | 107.1 |
| Maximum | 1985 |
| Range | 1985 |
| Interquartile range | 11 |
Descriptive statistics
| Standard deviation | 112.3499311 |
|---|---|
| Coef of variation | 4.475978316 |
| Kurtosis | 207.5163615 |
| Mean | 25.1006424 |
| MAD | 34.37883616 |
| Skewness | 12.95798816 |
| Sum | 11722 |
| Variance | 12622.50702 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 1 | 81 | 4.0% | |
| 2 | 57 | 2.9% | |
| 0 | 54 | 2.7% | |
| 4 | 32 | 1.6% | |
| 3 | 26 | 1.3% | |
| 6 | 24 | 1.2% | |
| 5 | 17 | 0.9% | |
| 7 | 15 | 0.8% | |
| 11 | 14 | 0.7% | |
| 8 | 11 | 0.5% | |
| Other values (65) | 136 | 6.8% | |
| (Missing) | 1533 | 76.6% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0 | 54 | 2.7% | |
| 1 | 81 | 4.0% | |
| 2 | 57 | 2.9% | |
| 3 | 26 | 1.3% | |
| 4 | 32 | 1.6% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 1985 | 1 | 0.1% | |
| 865 | 1 | 0.1% | |
| 585 | 1 | 0.1% | |
| 451 | 2 | 0.1% | |
| 290 | 1 | 0.1% |
qt_desligados_12meses
Numeric
| Distinct count | 17 |
|---|---|
| Unique (%) | 0.9% |
| Missing (%) | 76.6% |
| Missing (n) | 1533 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 1.468950749 |
|---|---|
| Minimum | 0 |
| Maximum | 233 |
| Zeros (%) | 16.6% |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| Median | 0 |
| Q3 | 1 |
| 95-th percentile | 4 |
| Maximum | 233 |
| Range | 233 |
| Interquartile range | 1 |
Descriptive statistics
| Standard deviation | 11.1897696 |
|---|---|
| Coef of variation | 7.61752537 |
| Kurtosis | 395.8513046 |
| Mean | 1.468950749 |
| MAD | 2.211124816 |
| Skewness | 19.24610978 |
| Sum | 686 |
| Variance | 125.2109437 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 0 | 332 | 16.6% | |
| 1 | 61 | 3.0% | |
| 2 | 23 | 1.1% | |
| 3 | 17 | 0.9% | |
| 4 | 11 | 0.5% | |
| 6 | 6 | 0.3% | |
| 5 | 4 | 0.2% | |
| 10 | 3 | 0.1% | |
| 9 | 2 | 0.1% | |
| 25 | 2 | 0.1% | |
| Other values (6) | 6 | 0.3% | |
| (Missing) | 1533 | 76.6% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0 | 332 | 16.6% | |
| 1 | 61 | 3.0% | |
| 2 | 23 | 1.1% | |
| 3 | 17 | 0.9% | |
| 4 | 11 | 0.5% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 233 | 1 | 0.1% | |
| 40 | 1 | 0.1% | |
| 25 | 2 | 0.1% | |
| 24 | 1 | 0.1% | |
| 14 | 1 | 0.1% |
qt_ex_funcionarios
Highly correlated
This variable is highly correlated with qt_desligados and should be ignored for analysis
| Correlation | 0.9999961944 |
|---|
qt_filiais
Numeric
| Distinct count | 42 |
|---|---|
| Unique (%) | 2.1% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 13.996 |
|---|---|
| Minimum | 0 |
| Maximum | 9270 |
| Zeros (%) | 90.9% |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| Median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 9270 |
| Range | 9270 |
| Interquartile range | 0 |
Descriptive statistics
| Standard deviation | 284.4729995 |
|---|---|
| Coef of variation | 20.32530719 |
| Kurtosis | 830.7848278 |
| Mean | 13.996 |
| MAD | 27.18814 |
| Skewness | 27.83323791 |
| Sum | 27992 |
| Variance | 80924.88743 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 0 | 1818 | 90.9% | |
| 1 | 87 | 4.3% | |
| 2 | 26 | 1.3% | |
| 3 | 12 | 0.6% | |
| 4 | 8 | 0.4% | |
| 5 | 3 | 0.1% | |
| 8 | 3 | 0.1% | |
| 9 | 3 | 0.1% | |
| 59 | 2 | 0.1% | |
| 7 | 2 | 0.1% | |
| Other values (32) | 36 | 1.8% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0 | 1818 | 90.9% | |
| 1 | 87 | 4.3% | |
| 2 | 26 | 1.3% | |
| 3 | 12 | 0.6% | |
| 4 | 8 | 0.4% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 9270 | 1 | 0.1% | |
| 7687 | 1 | 0.1% | |
| 2537 | 1 | 0.1% | |
| 2186 | 1 | 0.1% | |
| 1715 | 1 | 0.1% |
qt_funcionarios
Highly correlated
This variable is highly correlated with idade_de_44_a_48 and should be ignored for analysis
| Correlation | 0.9521282594 |
|---|
qt_funcionarios_12meses
Highly correlated
This variable is highly correlated with qt_funcionarios and should be ignored for analysis
| Correlation | 0.9870025118 |
|---|
qt_funcionarios_24meses
Highly correlated
This variable is highly correlated with qt_funcionarios_12meses and should be ignored for analysis
| Correlation | 0.9812462049 |
|---|
qt_funcionarios_coligados
Highly correlated
This variable is highly correlated with qt_coligados_sudeste and should be ignored for analysis
| Correlation | 0.9073230395 |
|---|
qt_funcionarios_coligados_gp
Highly correlated
This variable is highly correlated with max_funcionarios_coligados_gp and should be ignored for analysis
| Correlation | 0.9739337033 |
|---|
qt_funcionarios_grupo
Highly correlated
This variable is highly correlated with qt_filiais and should be ignored for analysis
| Correlation | 0.9003805679 |
|---|
qt_ramos_coligados
Highly correlated
This variable is highly correlated with qt_coligadas and should be ignored for analysis
| Correlation | 0.9442422997 |
|---|
qt_regioes_coligados
Categorical
| Distinct count | 5 |
|---|---|
| Unique (%) | 0.2% |
| Missing (%) | 86.9% |
| Missing (n) | 1737 |
| 1 | 232 |
|---|---|
| 2 | 22 |
| 4 | 5 |
| (Missing) |
| Value | Count | Frequency (%) | |
| 1 | 232 | 11.6% | |
| 2 | 22 | 1.1% | |
| 4 | 5 | 0.2% | |
| 3 | 4 | 0.2% | |
| (Missing) | 1737 | 86.9% |
| Max length | 3 |
|---|---|
| Mean length | 3 |
| Min length | 3 |
| Contains chars | True |
| Contains digits | True |
| Contains spaces | False |
| Contains non-words | True |
qt_socios
Numeric
| Distinct count | 14 |
|---|---|
| Unique (%) | 0.7% |
| Missing (%) | 25.1% |
| Missing (n) | 502 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 1.394526035 |
|---|---|
| Minimum | 1 |
| Maximum | 37 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| Median | 1 |
| Q3 | 2 |
| 95-th percentile | 2.15 |
| Maximum | 37 |
| Range | 36 |
| Interquartile range | 1 |
Descriptive statistics
| Standard deviation | 1.349889308 |
|---|---|
| Coef of variation | 0.9679914714 |
| Kurtosis | 350.9905593 |
| Mean | 1.394526035 |
| MAD | 0.5883652614 |
| Skewness | 15.30130766 |
| Sum | 2089 |
| Variance | 1.822201145 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 1 | 1117 | 55.9% | |
| 2 | 306 | 15.3% | |
| 3 | 42 | 2.1% | |
| 4 | 12 | 0.6% | |
| 5 | 8 | 0.4% | |
| 6 | 4 | 0.2% | |
| 9 | 2 | 0.1% | |
| 8 | 2 | 0.1% | |
| 19 | 1 | 0.1% | |
| 11 | 1 | 0.1% | |
| Other values (3) | 3 | 0.1% | |
| (Missing) | 502 | 25.1% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 1 | 1117 | 55.9% | |
| 2 | 306 | 15.3% | |
| 3 | 42 | 2.1% | |
| 4 | 12 | 0.6% | |
| 5 | 8 | 0.4% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 37 | 1 | 0.1% | |
| 19 | 1 | 0.1% | |
| 14 | 1 | 0.1% | |
| 11 | 1 | 0.1% | |
| 9 | 2 | 0.1% |
qt_socios_coligados
Highly correlated
This variable is highly correlated with qt_funcionarios_coligados and should be ignored for analysis
| Correlation | 0.9113228208 |
|---|
qt_socios_feminino
Numeric
| Distinct count | 6 |
|---|---|
| Unique (%) | 0.3% |
| Missing (%) | 68.8% |
| Missing (n) | 1377 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 1.109149278 |
|---|---|
| Minimum | 1 |
| Maximum | 11 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| Median | 1 |
| Q3 | 1 |
| 95-th percentile | 2 |
| Maximum | 11 |
| Range | 10 |
| Interquartile range | 0 |
Descriptive statistics
| Standard deviation | 0.5328106767 |
|---|---|
| Coef of variation | 0.4803777881 |
| Kurtosis | 196.6841777 |
| Mean | 1.109149278 |
| MAD | 0.2011290061 |
| Skewness | 11.74412016 |
| Sum | 691 |
| Variance | 0.2838872172 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 1 | 574 | 28.7% | |
| 2 | 40 | 2.0% | |
| 3 | 7 | 0.4% | |
| 5 | 1 | 0.1% | |
| 11 | 1 | 0.1% | |
| (Missing) | 1377 | 68.8% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 1 | 574 | 28.7% | |
| 2 | 40 | 2.0% | |
| 3 | 7 | 0.4% | |
| 5 | 1 | 0.1% | |
| 11 | 1 | 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 11 | 1 | 0.1% | |
| 5 | 1 | 0.1% | |
| 3 | 7 | 0.4% | |
| 2 | 40 | 2.0% | |
| 1 | 574 | 28.7% |
qt_socios_masculino
Numeric
| Distinct count | 9 |
|---|---|
| Unique (%) | 0.4% |
| Missing (%) | 57.0% |
| Missing (n) | 1139 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 1.221835075 |
|---|---|
| Minimum | 1 |
| Maximum | 32 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| Median | 1 |
| Q3 | 1 |
| 95-th percentile | 2 |
| Maximum | 32 |
| Range | 31 |
| Interquartile range | 0 |
Descriptive statistics
| Standard deviation | 1.208799857 |
|---|---|
| Coef of variation | 0.9893314417 |
| Kurtosis | 491.4704133 |
| Mean | 1.221835075 |
| MAD | 0.3849263679 |
| Skewness | 19.84665356 |
| Sum | 1052 |
| Variance | 1.461197094 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 1 | 747 | 37.4% | |
| 2 | 89 | 4.5% | |
| 3 | 12 | 0.6% | |
| 4 | 6 | 0.3% | |
| 5 | 3 | 0.1% | |
| 6 | 2 | 0.1% | |
| 8 | 1 | 0.1% | |
| 32 | 1 | 0.1% | |
| (Missing) | 1139 | 57.0% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 1 | 747 | 37.4% | |
| 2 | 89 | 4.5% | |
| 3 | 12 | 0.6% | |
| 4 | 6 | 0.3% | |
| 5 | 3 | 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 32 | 1 | 0.1% | |
| 8 | 1 | 0.1% | |
| 6 | 2 | 0.1% | |
| 5 | 3 | 0.1% | |
| 4 | 6 | 0.3% |
qt_socios_pep
Highly correlated
This variable is highly correlated with qt_socios_masculino and should be ignored for analysis
| Correlation | 0.9867818247 |
|---|
qt_socios_pf
Highly correlated
This variable is highly correlated with qt_socios_pep and should be ignored for analysis
| Correlation | 0.9309382851 |
|---|
qt_socios_pj
Categorical
| Distinct count | 5 |
|---|---|
| Unique (%) | 0.2% |
| Missing (%) | 25.1% |
| Missing (n) | 502 |
| 0 | |
|---|---|
| 1 | 12 |
| 2 | 6 |
| (Missing) |
| Value | Count | Frequency (%) | |
| 0 | 1479 | 74.0% | |
| 1 | 12 | 0.6% | |
| 2 | 6 | 0.3% | |
| 3 | 1 | 0.1% | |
| (Missing) | 502 | 25.1% |
| Max length | 3 |
|---|---|
| Mean length | 3 |
| Min length | 3 |
| Contains chars | True |
| Contains digits | True |
| Contains spaces | False |
| Contains non-words | True |
qt_socios_pj_ativos
Highly correlated
This variable is highly correlated with qt_socios_pj and should be ignored for analysis
| Correlation | 1 |
|---|
qt_socios_pj_baixados
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 99.1% |
| Missing (n) | 1981 |
| 0 | 19 |
|---|---|
| (Missing) |
| Value | Count | Frequency (%) | |
| 0 | 19 | 0.9% | |
| (Missing) | 1981 | 99.1% |
qt_socios_pj_inaptos
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 99.1% |
| Missing (n) | 1981 |
| 0 | 19 |
|---|---|
| (Missing) |
| Value | Count | Frequency (%) | |
| 0 | 19 | 0.9% | |
| (Missing) | 1981 | 99.1% |
qt_socios_pj_nulos
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 99.1% |
| Missing (n) | 1981 |
| 0 | 19 |
|---|---|
| (Missing) |
| Value | Count | Frequency (%) | |
| 0 | 19 | 0.9% | |
| (Missing) | 1981 | 99.1% |
qt_socios_pj_suspensos
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 99.1% |
| Missing (n) | 1981 |
| 0 | 19 |
|---|---|
| (Missing) |
| Value | Count | Frequency (%) | |
| 0 | 19 | 0.9% | |
| (Missing) | 1981 | 99.1% |
qt_socios_st_regular
Highly correlated
This variable is highly correlated with qt_socios_pf and should be ignored for analysis
| Correlation | 0.9649092969 |
|---|
qt_socios_st_suspensa
Categorical
| Distinct count | 3 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 99.3% |
| Missing (n) | 1986 |
| 1 | 13 |
|---|---|
| 2 | 1 |
| (Missing) |
| Value | Count | Frequency (%) | |
| 1 | 13 | 0.7% | |
| 2 | 1 | 0.1% | |
| (Missing) | 1986 | 99.3% |
| Max length | 3 |
|---|---|
| Mean length | 3 |
| Min length | 3 |
| Contains chars | True |
| Contains digits | True |
| Contains spaces | False |
| Contains non-words | True |
qt_ufs_coligados
Numeric
| Distinct count | 9 |
|---|---|
| Unique (%) | 0.4% |
| Missing (%) | 86.9% |
| Missing (n) | 1737 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 1.338403042 |
|---|---|
| Minimum | 1 |
| Maximum | 8 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| Median | 1 |
| Q3 | 1 |
| 95-th percentile | 3 |
| Maximum | 8 |
| Range | 7 |
| Interquartile range | 0 |
Descriptive statistics
| Standard deviation | 0.9707204558 |
|---|---|
| Coef of variation | 0.7252826133 |
| Kurtosis | 17.29454941 |
| Mean | 1.338403042 |
| MAD | 0.5610027614 |
| Skewness | 3.894238964 |
| Sum | 352 |
| Variance | 0.9422982033 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 1 | 218 | 10.9% | |
| 2 | 27 | 1.4% | |
| 3 | 6 | 0.3% | |
| 5 | 5 | 0.2% | |
| 4 | 4 | 0.2% | |
| 6 | 1 | 0.1% | |
| 7 | 1 | 0.1% | |
| 8 | 1 | 0.1% | |
| (Missing) | 1737 | 86.9% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 1 | 218 | 10.9% | |
| 2 | 27 | 1.4% | |
| 3 | 6 | 0.3% | |
| 4 | 4 | 0.2% | |
| 5 | 5 | 0.2% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 8 | 1 | 0.1% | |
| 7 | 1 | 0.1% | |
| 6 | 1 | 0.1% | |
| 5 | 5 | 0.2% | |
| 4 | 4 | 0.2% |
setor
Categorical
| Distinct count | 6 |
|---|---|
| Unique (%) | 0.3% |
| Missing (%) | 0.5% |
| Missing (n) | 11 |
| COMERCIO | |
|---|---|
| SERVIÇO | |
| INDUSTRIA | 134 |
| Other values (2) | 136 |
| Value | Count | Frequency (%) | |
| COMERCIO | 927 | 46.4% | |
| SERVIÇO | 792 | 39.6% | |
| INDUSTRIA | 134 | 6.7% | |
| CONSTRUÇÃO CIVIL | 122 | 6.1% | |
| AGROPECUARIA | 14 | 0.7% | |
| (Missing) | 11 | 0.5% |
| Max length | 16 |
|---|---|
| Mean length | 8.1595 |
| Min length | 3 |
| Contains chars | True |
| Contains digits | False |
| Contains spaces | True |
| Contains non-words | True |
sg_uf
Categorical
| Distinct count | 6 |
|---|---|
| Unique (%) | 0.3% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| MA | |
|---|---|
| RN | |
| AM | |
| Other values (3) |
| Value | Count | Frequency (%) | |
| MA | 534 | 26.7% | |
| RN | 415 | 20.8% | |
| AM | 353 | 17.6% | |
| PI | 328 | 16.4% | |
| RO | 266 | 13.3% | |
| AC | 104 | 5.2% |
| Max length | 2 |
|---|---|
| Mean length | 2 |
| Min length | 2 |
| Contains chars | True |
| Contains digits | False |
| Contains spaces | False |
| Contains non-words | False |
sg_uf_matriz
Categorical
| Distinct count | 18 |
|---|---|
| Unique (%) | 0.9% |
| Missing (%) | 0.5% |
| Missing (n) | 11 |
| MA | |
|---|---|
| RN | |
| AM | |
| Other values (14) |
| Value | Count | Frequency (%) | |
| MA | 523 | 26.2% | |
| RN | 410 | 20.5% | |
| AM | 347 | 17.3% | |
| PI | 320 | 16.0% | |
| RO | 259 | 13.0% | |
| AC | 99 | 5.0% | |
| DF | 7 | 0.4% | |
| SP | 5 | 0.2% | |
| RJ | 4 | 0.2% | |
| CE | 3 | 0.1% | |
| Other values (7) | 12 | 0.6% | |
| (Missing) | 11 | 0.5% |
| Max length | 3 |
|---|---|
| Mean length | 2.0055 |
| Min length | 2 |
| Contains chars | True |
| Contains digits | False |
| Contains spaces | False |
| Contains non-words | False |
sum_faturamento_estimado_coligadas
Highly correlated
This variable is highly correlated with media_faturamento_est_coligados_gp and should be ignored for analysis
| Correlation | 0.9984290135 |
|---|
total
Highly correlated
This variable is highly correlated with qt_funcionarios_24meses and should be ignored for analysis
| Correlation | 0.9914053461 |
|---|
total_filiais_coligados
Highly correlated
This variable is highly correlated with qt_socios_coligados and should be ignored for analysis
| Correlation | 0.9651560413 |
|---|
tx_crescimento_12meses
Numeric
| Distinct count | 69 |
|---|---|
| Unique (%) | 3.5% |
| Missing (%) | 82.9% |
| Missing (n) | 1658 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | -3.746989236 |
|---|---|
| Minimum | -100 |
| Maximum | 216.6666667 |
| Zeros (%) | 10.5% |
Quantile statistics
| Minimum | -100 |
|---|---|
| 5-th percentile | -50 |
| Q1 | 0 |
| Median | 0 |
| Q3 | 0 |
| 95-th percentile | 35.98571429 |
| Maximum | 216.6666667 |
| Range | 316.6666667 |
| Interquartile range | 0 |
Descriptive statistics
| Standard deviation | 31.37130232 |
|---|---|
| Coef of variation | -8.372402571 |
| Kurtosis | 10.55088936 |
| Mean | -3.746989236 |
| MAD | 16.56969509 |
| Skewness | 0.5035561138 |
| Sum | -1281.470319 |
| Variance | 984.1586091 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 0 | 210 | 10.5% | |
| -100 | 12 | 0.6% | |
| -33.33333333 | 10 | 0.5% | |
| -50 | 9 | 0.4% | |
| 50 | 8 | 0.4% | |
| -25 | 7 | 0.4% | |
| -14.28571429 | 5 | 0.2% | |
| 25 | 5 | 0.2% | |
| 100 | 3 | 0.1% | |
| 33.33333333 | 3 | 0.1% | |
| Other values (58) | 70 | 3.5% | |
| (Missing) | 1658 | 82.9% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| -100 | 12 | 0.6% | |
| -93.82716049 | 1 | 0.1% | |
| -83.33333333 | 1 | 0.1% | |
| -62.5 | 1 | 0.1% | |
| -60 | 1 | 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 216.6666667 | 1 | 0.1% | |
| 128.5714286 | 1 | 0.1% | |
| 100 | 3 | 0.1% | |
| 84.21052632 | 1 | 0.1% | |
| 57.14285714 | 1 | 0.1% |
tx_crescimento_24meses
Numeric
| Distinct count | 95 |
|---|---|
| Unique (%) | 4.8% |
| Missing (%) | 82.3% |
| Missing (n) | 1646 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | -13.95242578 |
|---|---|
| Minimum | -100 |
| Maximum | 600 |
| Zeros (%) | 6.6% |
Quantile statistics
| Minimum | -100 |
|---|---|
| 5-th percentile | -100 |
| Q1 | -42.26190476 |
| Median | 0 |
| Q3 | 0 |
| 95-th percentile | 68.68852459 |
| Maximum | 600 |
| Range | 700 |
| Interquartile range | 42.26190476 |
Descriptive statistics
| Standard deviation | 64.88506624 |
|---|---|
| Coef of variation | -4.650450557 |
| Kurtosis | 28.54370355 |
| Mean | -13.95242578 |
| MAD | 37.59206522 |
| Skewness | 3.769340498 |
| Sum | -4939.158727 |
| Variance | 4210.071821 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 0 | 131 | 6.6% | |
| -100 | 39 | 1.9% | |
| -50 | 17 | 0.9% | |
| -25 | 12 | 0.6% | |
| -33.33333333 | 11 | 0.5% | |
| 100 | 8 | 0.4% | |
| -20 | 8 | 0.4% | |
| -66.66666667 | 7 | 0.4% | |
| -40 | 6 | 0.3% | |
| 200 | 4 | 0.2% | |
| Other values (84) | 111 | 5.5% | |
| (Missing) | 1646 | 82.3% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| -100 | 39 | 1.9% | |
| -94.73684211 | 1 | 0.1% | |
| -90.90909091 | 1 | 0.1% | |
| -87.5 | 1 | 0.1% | |
| -85.71428571 | 1 | 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 600 | 1 | 0.1% | |
| 400 | 1 | 0.1% | |
| 300 | 1 | 0.1% | |
| 200 | 4 | 0.2% | |
| 125 | 1 | 0.1% |
tx_rotatividade
Numeric
| Distinct count | 69 |
|---|---|
| Unique (%) | 3.5% |
| Missing (%) | 76.6% |
| Missing (n) | 1533 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 8.127274797 |
|---|---|
| Minimum | 0 |
| Maximum | 200 |
| Zeros (%) | 18.1% |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| Median | 0 |
| Q3 | 0 |
| 95-th percentile | 40 |
| Maximum | 200 |
| Range | 200 |
| Interquartile range | 0 |
Descriptive statistics
| Standard deviation | 22.02773902 |
|---|---|
| Coef of variation | 2.710347512 |
| Kurtosis | 29.0538715 |
| Mean | 8.127274797 |
| MAD | 12.69423094 |
| Skewness | 4.662630838 |
| Sum | 3795.43733 |
| Variance | 485.2212865 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 0 | 362 | 18.1% | |
| 33.33333333 | 9 | 0.4% | |
| 28.57142857 | 8 | 0.4% | |
| 40 | 7 | 0.4% | |
| 25 | 4 | 0.2% | |
| 15.38461538 | 2 | 0.1% | |
| 16.66666667 | 2 | 0.1% | |
| 23.52941176 | 2 | 0.1% | |
| 11.76470588 | 2 | 0.1% | |
| 90.90909091 | 2 | 0.1% | |
| Other values (58) | 67 | 3.4% | |
| (Missing) | 1533 | 76.6% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0 | 362 | 18.1% | |
| 2.941176471 | 1 | 0.1% | |
| 4 | 1 | 0.1% | |
| 4.651162791 | 1 | 0.1% | |
| 5 | 1 | 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 200 | 1 | 0.1% | |
| 190.9836066 | 1 | 0.1% | |
| 150 | 1 | 0.1% | |
| 133.3333333 | 1 | 0.1% | |
| 90.90909091 | 2 | 0.1% |
vl_faturamento_estimado_aux
Highly correlated
This variable is highly correlated with total and should be ignored for analysis
| Correlation | 0.9849750322 |
|---|
vl_faturamento_estimado_grupo_aux
Highly correlated
This variable is highly correlated with qt_socios_st_suspensa and should be ignored for analysis
| Correlation | 0.9487115171 |
|---|
vl_folha_coligados
Highly correlated
This variable is highly correlated with qt_socios_pep and should be ignored for analysis
| Correlation | 0.9470885171 |
|---|
vl_folha_coligados_gp
Highly correlated
This variable is highly correlated with vl_folha_coligados and should be ignored for analysis
| Correlation | 0.9239805026 |
|---|
vl_frota
Numeric
| Distinct count | 112 |
|---|---|
| Unique (%) | 5.6% |
| Missing (%) | 94.2% |
| Missing (n) | 1885 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 101608.5478 |
|---|---|
| Minimum | 1680 |
| Maximum | 783042 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 1680 |
|---|---|
| 5-th percentile | 4225.6 |
| Q1 | 29530 |
| Median | 57476 |
| Q3 | 106890 |
| 95-th percentile | 354267.6 |
| Maximum | 783042 |
| Range | 781362 |
| Interquartile range | 77360 |
Descriptive statistics
| Standard deviation | 134085.1766 |
|---|---|
| Coef of variation | 1.319624967 |
| Kurtosis | 9.34406756 |
| Mean | 101608.5478 |
| MAD | 86045.55599 |
| Skewness | 2.870761613 |
| Sum | 11684983 |
| Variance | 1.797883459e+10 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 72838 | 2 | 0.1% | |
| 98819 | 2 | 0.1% | |
| 39289 | 2 | 0.1% | |
| 41298 | 2 | 0.1% | |
| 76140 | 1 | 0.1% | |
| 8069 | 1 | 0.1% | |
| 169124 | 1 | 0.1% | |
| 241833 | 1 | 0.1% | |
| 71556 | 1 | 0.1% | |
| 783042 | 1 | 0.1% | |
| Other values (101) | 101 | 5.1% | |
| (Missing) | 1885 | 94.2% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 1680 | 1 | 0.1% | |
| 2429 | 1 | 0.1% | |
| 3306 | 1 | 0.1% | |
| 3375 | 1 | 0.1% | |
| 3392 | 1 | 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 783042 | 1 | 0.1% | |
| 656552 | 1 | 0.1% | |
| 561218 | 1 | 0.1% | |
| 537698 | 1 | 0.1% | |
| 481226 | 1 | 0.1% |
vl_idade_maxima_socios_pj
Numeric
| Distinct count | 19 |
|---|---|
| Unique (%) | 0.9% |
| Missing (%) | 99.1% |
| Missing (n) | 1981 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 10.94362189 |
|---|---|
| Minimum | 1.314168378 |
| Maximum | 30.06433949 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 1.314168378 |
|---|---|
| 5-th percentile | 3.987679671 |
| Q1 | 6.977412731 |
| Median | 9.87816564 |
| Q3 | 12.55715264 |
| 95-th percentile | 24.01013005 |
| Maximum | 30.06433949 |
| Range | 28.75017112 |
| Interquartile range | 5.579739904 |
Descriptive statistics
| Standard deviation | 6.800181946 |
|---|---|
| Coef of variation | 0.621383123 |
| Kurtosis | 2.679513486 |
| Mean | 10.94362189 |
| MAD | 4.756913358 |
| Skewness | 1.463400182 |
| Sum | 207.9288159 |
| Variance | 46.2424745 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 9.87816564 | 2 | 0.1% | |
| 23.33744011 | 1 | 0.1% | |
| 6.119096509 | 1 | 0.1% | |
| 15.34291581 | 1 | 0.1% | |
| 4.594113621 | 1 | 0.1% | |
| 30.06433949 | 1 | 0.1% | |
| 13.3744011 | 1 | 0.1% | |
| 5.837097878 | 1 | 0.1% | |
| 8.884325804 | 1 | 0.1% | |
| 8.265571526 | 1 | 0.1% | |
| Other values (8) | 8 | 0.4% | |
| (Missing) | 1981 | 99.1% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 1.314168378 | 1 | 0.1% | |
| 4.284736482 | 1 | 0.1% | |
| 4.594113621 | 1 | 0.1% | |
| 5.837097878 | 1 | 0.1% | |
| 6.119096509 | 1 | 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 30.06433949 | 1 | 0.1% | |
| 23.33744011 | 1 | 0.1% | |
| 16.57494867 | 1 | 0.1% | |
| 15.34291581 | 1 | 0.1% | |
| 13.3744011 | 1 | 0.1% |
vl_idade_media_socios_pj
Highly correlated
This variable is highly correlated with vl_idade_maxima_socios_pj and should be ignored for analysis
| Correlation | 0.9804465153 |
|---|
vl_idade_minima_socios_pj
Highly correlated
This variable is highly correlated with vl_idade_media_socios_pj and should be ignored for analysis
| Correlation | 0.9810183741 |
|---|
vl_potenc_cons_oleo_gas
Highly correlated
This variable is highly correlated with vl_folha_coligados_gp and should be ignored for analysis
| Correlation | 0.9999548647 |
|---|
vl_total_tancagem
Constant
This variable is constant and should be ignored for analysis
| Constant value | nan |
|---|
vl_total_tancagem_grupo
Highly correlated
This variable is highly correlated with vl_folha_coligados_gp and should be ignored for analysis
| Correlation | 1 |
|---|
vl_total_veiculos_antt
Constant
This variable is constant and should be ignored for analysis
| Constant value | nan |
|---|
vl_total_veiculos_antt_grupo
Constant
This variable is constant and should be ignored for analysis
| Constant value | nan |
|---|
vl_total_veiculos_leves
Numeric
| Distinct count | 13 |
|---|---|
| Unique (%) | 0.7% |
| Missing (%) | 93.2% |
| Missing (n) | 1864 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 1.610294118 |
|---|---|
| Minimum | 0 |
| Maximum | 24 |
| Zeros (%) | 1.6% |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| Median | 1 |
| Q3 | 2 |
| 95-th percentile | 5 |
| Maximum | 24 |
| Range | 24 |
| Interquartile range | 1 |
Descriptive statistics
| Standard deviation | 2.602485405 |
|---|---|
| Coef of variation | 1.61615532 |
| Kurtosis | 41.78943292 |
| Mean | 1.610294118 |
| MAD | 1.350129758 |
| Skewness | 5.544720745 |
| Sum | 219 |
| Variance | 6.772930283 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 1 | 66 | 3.3% | |
| 0 | 32 | 1.6% | |
| 2 | 18 | 0.9% | |
| 3 | 7 | 0.4% | |
| 4 | 4 | 0.2% | |
| 5 | 3 | 0.1% | |
| 11 | 1 | 0.1% | |
| 8 | 1 | 0.1% | |
| 24 | 1 | 0.1% | |
| 9 | 1 | 0.1% | |
| Other values (2) | 2 | 0.1% | |
| (Missing) | 1864 | 93.2% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0 | 32 | 1.6% | |
| 1 | 66 | 3.3% | |
| 2 | 18 | 0.9% | |
| 3 | 7 | 0.4% | |
| 4 | 4 | 0.2% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 24 | 1 | 0.1% | |
| 11 | 1 | 0.1% | |
| 9 | 1 | 0.1% | |
| 8 | 1 | 0.1% | |
| 7 | 1 | 0.1% |
vl_total_veiculos_leves_grupo
Numeric
| Distinct count | 35 |
|---|---|
| Unique (%) | 1.8% |
| Missing (%) | 0.5% |
| Missing (n) | 11 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 21.32679739 |
|---|---|
| Minimum | 0 |
| Maximum | 35064 |
| Zeros (%) | 91.7% |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| Median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 35064 |
| Range | 35064 |
| Interquartile range | 0 |
Descriptive statistics
| Standard deviation | 788.3362551 |
|---|---|
| Coef of variation | 36.96458689 |
| Kurtosis | 1966.891524 |
| Mean | 21.32679739 |
| MAD | 41.80239684 |
| Skewness | 44.23709965 |
| Sum | 42419 |
| Variance | 621474.0511 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 0 | 1834 | 91.7% | |
| 1 | 71 | 3.5% | |
| 2 | 26 | 1.3% | |
| 3 | 9 | 0.4% | |
| 4 | 7 | 0.4% | |
| 8 | 5 | 0.2% | |
| 5 | 4 | 0.2% | |
| 88 | 2 | 0.1% | |
| 6 | 2 | 0.1% | |
| 18 | 2 | 0.1% | |
| Other values (24) | 27 | 1.4% | |
| (Missing) | 11 | 0.5% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0 | 1834 | 91.7% | |
| 1 | 71 | 3.5% | |
| 2 | 26 | 1.3% | |
| 3 | 9 | 0.4% | |
| 4 | 7 | 0.4% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 35064 | 1 | 0.1% | |
| 2134 | 1 | 0.1% | |
| 888 | 1 | 0.1% | |
| 782 | 1 | 0.1% | |
| 479 | 1 | 0.1% |
vl_total_veiculos_pesados
Highly correlated
This variable is highly correlated with vl_potenc_cons_oleo_gas and should be ignored for analysis
| Correlation | 0.9118619239 |
|---|
vl_total_veiculos_pesados_grupo
Highly correlated
This variable is highly correlated with vl_potenc_cons_oleo_gas and should be ignored for analysis
| Correlation | 0.909344757 |
|---|